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TITLE OF THE INVENTION 

HEPATITIS C VIRUS REPLICONS AND REPUCON ENHANCED CELLS 

CROSS-REFERENCE TO RELATED APPLICATIONS 
5 The present application claims priority to U.S. Serial No. 60/263,479, 

filed January 23, 2001, hereby incorporated by reference herein. 

BACKGROUND OF THE INVENTION 

The references cited in the present application are not admitted to be 

10 prior art to the claimed invention. 

It is estimated that about 3% of the world's population are infected 
with the Hepatitis C virus (HCV). (Wasley, et al, 2000. Semin. Liver Dis. 20, 1-16.) 
Exposure to HCV results in an overt acute disease in a small percentage of cases, 
while in most instances the virus establishes a chronic infection causing liver 

15 inflammation and slowly progresses into liver failure and cirrhosis. (Iwarson, 1994. 
FEMS Microbiol. Rev. 14, 201-204.) In addition, epidemiological surveys indicate an 
important role of HCV in the pathogenesis of hepatocellular carcinoma. (Kew, 1994. 
FEMS Microbiol. Rev. 14, 211-220, Alter, 1995. Blood 85, 1681-1695.) 

The HCV genome consists of a single strand RNA of about 9.5 kb in 

20 length, encoding a precursor polyprotein of about 3000 amino acids. (Choo, et ai, 
1989. Science 244, 362-364, Choo, et al. r 1989. Science 244, 359-362, Takamizawa, 
et al. p 1991. J. Virol. 65, 1 105-1 1 13.) The HCV polyprotein contains the viral 
proteins in the order: C-El-E2-p7-NS2-NS3-NS4A-NS4B-NS5A-NS5B. 

Individual viral proteins are produced by proteolysis of the HCV 

25 polyprotein. Host cell proteases release the putative structural proteins C, El, E2, and 
p7, and create the N-terminus of NS2 at amino acid 810. (Mizushima, et al. t 1994. J. 
Virol. 68, 2731-2734, Hijikata, etal, 1993. P.N.A.S. USA 90, 10773-10777.) 

The non-structural proteins NS3, NS4A, NS4B, NS5A and NS5B 
presumably form the virus replication machinery and are released from the 

30 polyprotein. A zinc-dependent protease associated with NS2 and the N-terminus of 
NS3 is responsible for cleavage between NS2 and NS3. (Grakoui, et al. p 1993. 7. 
Virol 67, 1385-1395, Hijikata, et aL, 1993. P.N.A.S. USA 90, 10773-10777.) A 
distinct serine protease located in the N-terminal domain of NS3 is responsible for 
proteolytic cleavages at the NS3/NS4A, NS4A/NS4B, NS4B/NS5A and NS5A/NS5B 

35 junctions. (Barthenschlager, et at., 1993. J. Virol. 67, 3835-3844, Grakoui, et aL, 
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1993. Proc. Natl Acad. Sci. USA 90, 10583-10587, Tomei, et al, 1993. /. Virol. 67, 
4017-4026.) NS4A provides a cofactor for NS3 activity. (Failla, et al, J. Virol 1994. 
68, 3753-3760, De Francesco, et al, U.S. Patent No. 5,739,002.) NS5A is a highly 
phosphorylated protein concurring interferon resistance. (De Francesco, et al, 2000. 
5 Semin Liver Dis., 20(1), 69-83, Pawlotsky, 1999. J. Viral Hepat. Suppl. 1, 47-48.) 
NS5B provides an RNA polymerase. (De Francesco, et al, International Publication 
Number WO 96/37619, Behrens, et al, 1996. EMBO 15, 12-22, Lohmann, et al, 
1998. Virology 249, 108-118.) 

Lohmann, et al, Science 285, 1 10-1 13, 1999, illustrates the ability of a 
10 biscistronic HCV replicon to replicate in a hepatoma cell line. The biscistonic HCV 
replicon contained a neomycin cistron and an NS2-NS5B or an NS3-NS5B cistron. 
"NS2-NS5B" refers to a NS2-NS3-NS4A-NS4B-NS5A-NS5B polyprotein. "NS3- 
NS5B" refers to a NS3-NS4A-NS4B-NS5A-NS5B polyprotein. 

Bartenschlager, European Patent Application 1 043 399, published 
15 October 11, 2000 (not admitted to be prior art to the claimed invention), describes a 
cell culture system for autonomous HCV RNA replication and protein expression. 
Replication and protein expression is indicated to occur in sufficiently large amounts 
for quantitative determination. European Patent Application 1 043 399 indicates that 
prior cell lines or primary cell cultures infected with HCV do not provide favorable 
20 circumstances for detecting HCV replication. 

SUMMARY OF THE INVENTION 

The present invention features nucleic acid containing one or more 
adaptive mutations, and HCV replicon enhanced cells. Adaptive mutations are 

25 mutations that enhance HCV replicon activity. HCV replicon enhanced cells are cells 
having an increased ability to maintain an HCV replicon. 

An HCV replicon is an RNA molecule able to autonomously replicate 
in a cultured cell and produce detectable levels of one or more HCV proteins. The 
basic subunit of an HCV replicon encodes for a HCV NS3-NS5B polyprotein along 

30 with a suitable 5' UTR-partial core (PC) region and 3' UTR. The 5' UTR-PC region 
is made up of a 5'UTR region and about 36 nucleotides of the beginning of the core. 
Additional regions may be present including those coding for HCV proteins or 
elements such as the complete core, El, E2, p7 or NS2; and those coding for other 
types of proteins or elements such as a encephalomyocarditis virus (EMCV) internal 

35 ribosome entry site (IRES), a reporter protein or a selection protein. 
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The present application identifies different adaptive mutations that 
enhance HCV replicon activity. Enhancing replicon activity brings about at least one 
of the following: an increase in replicon maintenance in a cell, an increase in replicon 
replication, and an increase in replicon protein expression. 
5 Adaptive mutations are described herein by identifying the location of 

the adaptive mutation with respect to a reference sequence present in a particular 
region. Based on the provided reference sequence, the same adaptive mutation can be 
produced in corresponding locations of equivalent regions having an amino acid 
sequence different than the reference sequence. Equivalent regions have the same 

10 function or encode for a polypeptide having the same function. 

Replicon enhanced cells are a preferred host for the insertion and 
expression of an HCV replicon. Replicon enhanced cells are initially produced by 
creating a cell containing a HCV replicon and then curing the cell of the replicon. 
The term "replicon enhanced cell" includes cells cured of HCV replicons and progeny 

15 of such cells. 

Thus, a first aspect of the present invention describes a nucleic acid 
molecule comprising at least one of the following regions: an altered NS3 encoding 
region, an altered NS5A encoding region, and an altered EMCV IRES region. The 
altered region contains one or more adaptive mutations. Reference to the presence of 
20 particular adaptive mutation(s) does not exclude other mutations or adaptive 

mutations from being present. Adaptive mutations are described with reference to 
either an encoded amino acid sequence or a nucleic acid sequence. 

A nucleic acid molecule can be single-stranded or part of a double 
strand, and can be RNA or DNA. Depending upon the structure of the nucleic acid 
25 molecule, the molecule may be used as a replicon or in the production of a replicon. 
For example, single-stranded RNA having the proper regions can be a replicon, while 
double-stranded DNA that includes the complement of a sequence coding for a 
replicon or replicon intermediate may useful in the production of the replicon or 
replicon intermediate. 

30 Preferred nucleic acid molecules are those containing region(s) from 

SEQ. ID. NOs. 1, 2, or 3, or the RNA version thereof, with one or more adaptive 
mutations. Reference to "the RNA version thereof indicates a ribose backbone and 
the presence of uracil instead of thymine. 

The presence of a region containing an adaptive mutation indicates that 

35 at least one such region is present. In different embodiments, for example, adaptive 
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mutations described herein are present at least in the NS3 region, in the NS5A region, 
in the NS3 and NS5A regions, in the EMCV IRES and NS3 regions, in the EMCV 
and NS5A regions, and in the ECMV IRES, NS3 and NS5A regions. 

Another aspect of the present invention describes an expression vector 
5 comprising a nucleotide sequence of an HCV replicon or replicon intermediate 

coupled to an exogenous promoter. Reference to a nucleotide sequence "coupled to 
an exogenous promoter" indicates the presence and positioning of an RNA promoter 
such that it can mediate transcription of the nucleotide sequence and that the promoter 
is not naturally associated with the nucleotide sequence being transcribed. The 

10 expression vector can be used to produce RNA replicons. 

Another aspect of the present invention describes a recombinant 
human hepatoma cell. Reference to a recombinant cell includes an initially produced 
cell and progeny thereof. 

Another aspect of the present invention describes a method of making 

15 a HCV replicon enhanced cell. The method involves the steps of: (a) introducing and 
maintaining an HCV replicon into a cell and (b) curing the cell of the HCV replicon. 

Another aspect of the present invention describes an HCV replicon 
enhanced cell made by a process comprising the steps of: (a) introducing and 
maintaining an HCV replicon into a cell and (b) curing the cell of the HCV replicon. 

20 Another aspect of the present invention describes a method of making 

a HCV replicon enhanced cell comprising an HCV replicon. The method involves (a) 
introducing and maintaining a first HCV replicon into a cell, (b) curing the cell of the 
replicon, and (c) introducing and maintaining a second replicon into the cured cell, 
where the second replicon may be the same or different as the first replicon. 

25 Another aspect of the present invention describes an HCV replicon 

enhanced cell containing a HCV replicon made by the process involving the step of 
introducing an HCV replicon into an HCV replicon enhanced cell. The HCV replicon 
introduced into the HCV replicon enhanced cell may be the same or different than the 
HCV replicon used to produce the HCV replicon enhanced cell. In a preferred 

30 embodiment, the HCV replicon introduced into an HCV replicon enhanced cell is the 
same replicon as was used to produce the enhanced cell. 

Another aspect of the present invention describes a method of 
measuring the ability of a compound to affect HCV activity using an HCV replicon 
comprising an adaptive mutation described herein. The method involves providing a 

35 compound to a cell comprising the HCV replicon and measuring the ability of the 
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compound to affect one or more replicon activities as a measure of the effect on HCV 
activity. 

Another aspect of the present invention describes a method of 
measuring the ability of a compound to affect HCV activity using an HCV replicon 
5 enhanced cell that comprises an HCV replicon. The method involves providing a 
compound to the cell and measuring the ability of the compound to effect one or more 
replicon activities as a measure of the effect on HCV activity. 

Other features and advantages of the present invention are apparent 
from the additional descriptions provided herein including the different examples. 
10 The provided examples illustrate different components and methodology useful in 
practicing the present invention. The examples do not limit the claimed invention. 
Based on the present disclosure the skilled artisan can identify and employ other 
components and methodology useful for practicing the present invention. 

1 5 BRIEF DESCRIPTION OF THE DRAWINGS 

Figures 1 A-1G illustrate the nucleic acid sequence for the 
pHCVNeo.17 coding strand (SEQ. ID. NO. 3). The different regions of pHCVNeo. 17 
are provided as follows: 

1-341: HCV 5' non-translated region, drives translation of the core-neo fusion protein; 
20 342- 1181: Core-neo fusion protein, selectable marker; 

1 190-1800: Internal ribosome entry site of the encephalomyocarditis virus, drives 
translation of the HCV NS region; 

1801-7755: HCV polyprotein from non-structural protein 3 to non-structural protein 
5B; 

25 1801-3696: Non-structural protein 3 (NS3), HCV NS3 protease/helicase; 
3697-3858: Non-structural protein 4A (NS4A), NS3 protease cofactor; 
3859-4641: Non-structural protein 4B (NS4B); 
4642-5982: Non-structural protein 5A (NS5A); 

5983-7755: Non-structural protein 5B (NS5B); RNA-dependent RNA polymerase 
30 7759-7989: HCV 3' non-translated region; and 

7990-10690 plasmid sequences comprising origin of replication, beta lactamase 
coding sequence, and T7 promoter. 
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DETAILED DESCRIPTION OF THE INVENTION 

HCV replicons and HCV replicon enhanced cells can be used to 
produce a cell culture providing detectable levels of HCV RNA and HCV protein. 
HCV replicons and HCV replicon enhanced hosts can both be obtained by selecting 
5 for the ability to maintain an HCV replicon in a cell. As illustrated in the examples 
provided below, adaptive mutations present in HCV replicons and host cells can both 
assist replicon maintenance in a cell. 

The detectable replication and expression of HCV RNA in a cell 
culture system has a variety of different uses including being used to study HCV 
10 replication and expression, to study HCV and host cell interactions, to produce HCV 
RNA, to produce HCV proteins, and to provide a system for measuring the ability of a 
compound to modulate one or more HCV activities. 

Preferred cells for use with a HCV replicon are Huh-7 cells and Huh-7 
derived cells. "Huh-7 derived cells" are cell produced starting with Huh-7 cells and 
15 introducing one or more phenotypic and/or genotypic modifications. 

Adaptive Mutations 
Adaptive mutations enhance the ability of an HCV replicon to be 
maintained and expressed in a host cell. Adaptive mutations can be initially selected 
20 for using a wild type HCV RNA construct or a mutated HCV replicon. Initial 

selection involves providing HCV replicons to cells and identifying clones containing 
a replicon. 

Nucleic acid sequences of identified HCV replicons can be determined 
using standard sequencing techniques. Comparing the sequence of input HCV 

25 constructs and selected constructs provides the location of mutations. The effect of 
particular mutation(s) can be measured by, for example, producing a construct to 
contain particular mutation(s) and measuring the effect of these mutation(s). Suitable 
control constructs for comparison purposes include wild type constructs and 
constructs previously evaluated. 

30 Adaptive mutations were predominantly found in the HCV NS3 and 

NS5A regions. With the exception of two silent mutations in NS5A and NS5B, 
consensus mutations occurring in the NS region resulted in changes to the deduced 
amino acid sequence. Noticeably, the amino acid changes occurred in residues that 
are conserved in all or a large number of natural HCV isolates. HCV sequences are 

35 well known in the art and can be found, for example, in GenBank. 
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Adaptive mutations described herein can be identified with respect to a 
reference sequence. The reference sequence provides the location of the adaptive 
mutation in, for example, the NS3 or NS5A RNA, cDNA, or amino acid sequence. 
The remainder of the sequence encodes for a functional protein that may have the 
5 same, or a different, sequence than the reference sequence. 

Preferred NS3 and NS5A adaptive mutations and examples of changes 
that can be made to produce such mutations are shown in Tables 1 and 2. The amino 
acid numbering shown in Tables 1 and 2 is with respect to SEQ. ED. NO. 1. The 
nucleotide numbering shown in Tables 1 and 2 is with respect to SEQ. ED. NO. 2. 
10 SEQ. ID. NO. 1 provides the amino acid sequence of the Conl HCV isolate 

(Accession Number AJ238799). SEQ. ID. NO. 2 provides the nucleic acid sequence 
of the Conl HCV isolate. 

TABLE 1 

15 



Preferred NS3 Adaptive Mutations 



Amino Acid 


Nucleotide 


g]yl095ala 


G3625C 


glul202gly 


A3946G 


alal347thr 


G4380A 


TABLE 2 


Preferred NS5A Adaptive Mutations 


Amino Acid 


Nucleotide 


Lys@2039 


AAA@6458 


asn2041thr 


A6463C 


ser2173phe 


C6859T 


ser2197phe 


C6931T 


Ieu2198ser 


T6934C 


ala2199thr 


G6936A 


ser2204arg 


C6953A (orG) 


"@" refers to an addition. 
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Preferred adaptive mutations identified with respect to a reference 
sequence can be produced changing the encoding region of SEQ. ID. NO. 1, or an 
equivalent sequence, to result in the indicated change. Preferred adaptive mutations 
5 provided in Tables 1 and 2 occur in amino acids conserved among different HCV 
isolates. 

Adaptive mutations have different effects. Some mutations alone, or 

in combination with other mutations, enhance HCV replicon activity. In some cases, 

two or more mutations led to synergistic effects and in one case, a slightly 
10 antagonistic effect was observed. 

An adaptive mutation once identified can be introduced into a starting 

construct using standard genetic techniques. Examples of such techniques are 

provided by Ausubel, Current Protocols in Molecular Biology, John Wiley, 1987- 

1998, and Sambrook, et al. y Molecular Cloning, A Laboratory Manual, 2 nd Edition, 
15 Cold Spring Harbor Laboratory Press, 1989. 

HCV replicons containing adaptive mutations can be built around an 

NS3 region or NS5A region containing one or more adaptive mutations described 

herein. The final replicon will contain replicon components needed for replication 

and may contain additional components. 
20 SEQ. ED. NO. 2 can be used as a reference point for different HCV 

regions as follows: 

5' UTR- nucleotides 1-341; 

Core- nucleotides 342-914; 

El- nucleotides 915-1490; 
25 E2- nucleotides 1491-2579; 

P7- nucleotides 2580-2768; 

NS2- nucleotides 2769-3419; 

NS3- nucleotides 3420-5312; 

NS4A- nucleotides 5313-5474; 
30 NS4B- nucleotides 5475-6257; 

NS5A- nucleotides 6258-7598; 

NS5B- nucleotides 7599-9371; and 

3' UTR- nucleotides 9374-9605. 

The amino acid sequences of the different structural and non-structural regions is 
35 provided by SEQ. ID. NO. I. 
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Nucleic acid sequences encoding for a particular amino acid can be 
produced taking into account the degeneracy of the genetic code. The degeneracy of 
the genetic code arises because almost all amino acids are encoded for by different 
combinations of nucleotide triplets or "codons". The translation of a particular codon 
5 into a particular amino acid is well known in the art (see, e.g., Lewin GENES IV, p. 
1 19, Oxford University Press, 1990). Amino acids are encoded for by RNA codons as 
follows: 

A=Ala=Alanine: codons GCA, GCC, GCG, GCU 

C=Cys=Cysteine: codons UGC, UGU 
10 D=Asp=Aspartic acid: codons GAC, GAU 

E=Glu=Glutamic acid: codons GAA, GAG 

F=Phe=Phenylalanine: codons UUC, UUU 

G=Gly=Glycine: codons GGA, GGC, GGG, GGU 

H=His=Histidine: codons CAC, CAU 
15 I=Ile=Isoleucine: codons AUA, AUC, AUU 

K=Lys=Lysine: codons AAA, AAG 

L=Leu=Leucine: codons UUA, UUG, CUA, CUC, CUG, CUU 
M=Met=Methionine: codon AUG 
N=Asn=Asparagine: codons AAC, AAU 
20 P=Pro=Proline: codons CCA, CCC, CCG, CCU 
Q=Gln=GIutamine: codons CAA, CAG 

R=Arg=Arginine: codons AGA, AGG, CGA, CGC, CGG, CGU 

S=Ser=Serine: codons AGC, AGU, UCA, UCC, UCG, UCU 

T=Thr=Threonine: codons ACA, ACC, ACG, ACU 
25 V=Val=Valine: codons GUA, GUC, GUG, GUU 

W=Trp=Tryptophan: codon UGG 

Y=Tyr=Tyrosine: codons UAC, UAU. 

Constructs, including subgenomic and genomic replicons, containing 

one or more of the adaptive mutations described herein can also contain additional 
30 mutations. The additional mutations may be adaptive mutations and mutations not 

substantially inhibiting replicon activity. Mutations not substantially inhibiting 

replicon activity provide for a replicon that can be introduced into a cell and have 

detectable activity. 
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HCV Replicon 

HCV replicons include the full length HCV genome and subgenomic 
constructs. A basic HCV replicon is a subgenomic construct containing an HCV 5' 
UTR- PC region, an HCV NS3-NS5B polyprotein encoding region, and a HCV 3' 
5 UTR. Other nucleic acid regions can be present such as those providing for HCV 
NS2, structural HCV protein(s) and non-HCV sequences. 

The HCV 5' UTR-PC region provides an internal ribosome entry site 
(IRES) for protein translation and elements needed for replication. The HCV 5'UTR- 
PC region includes naturally occurring HCV 5' UTR extending about 36 nucleotides 
10 into a HCV core encoding region, and functional derivatives thereof. The 5'-UTR-PC 
region can be present in different locations such as site downstream from a sequence 
encoding a selection protein, a reporter, protein, or an HCV polyprotein. 

Functional derivatives of the 5'-UTR-PC region able to initiate 
translation and assist replication can be designed taking into structural requirements 
15 for HCV translation initiation. (See, for example, Honda, et al y 1996. Virology 222, 
31-42). The affect of different modifications to a 5' UTR-PC region can be 
determined using techniques that measure replicon activity. 

In addition to the HCV 5' UTR-PC region, non-HCV IRES elements 
can also be present in the replicon. The non-HCV IRES elements can be present in 
20 different locations including immediately upstream the region encoding for an HCV 
polyprotein. Examples of non-HCV IRES elements that can be used are the EMCV 
IRES, poliovirus IRES, and bovine viral diarrhea virus IRES. 

The HCV 3' UTR assists HCV replication. HCV 3' UTR includes 
naturally occurring HCV 3* UTR and functional derivatives thereof. Naturally 
25 occurring 3' UTR's include a poly U tract and an additional region of about 100 

nucleotides. (Tanaka, et ai, 1996.7. Virol 70, 3307-3312, Kolykhalov, etai, 1996. 
7. Virol. 70, 3363-3371.) At least in vivo, the 3' UTR appears to be essential for 
replication. (Kolykhalov, etai, 2000. 7. Virol. 2000 4, 2046-2051.) Examples of 
naturally occurring 3' UTR derivatives are described by Bartenschlager International 
30 Publication Number EP 1 043 399. 

The NS3-NS5B polyprotein encoding region provides for a polyprotein 
that can be processed in a cell into different proteins. Suitable NS3-NS5B polyprotein 
sequences that may be part of a replicon include those present in different HCV 
strains and functional equivalents thereof resulting in the processing of NS3-NS5B to 
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a produce functional replication machinery. Proper processing can be measured for 
by assaying, for example, NS5B RNA dependent RNA polymerase. 

The ability of an NS5B protein to provide RNA polymerase activity 
can be measured using techniques well known in the art. (See, for example, De 
5 Franscesco, et aL, International Publication Number WO 96/37619, Behrens, et al. f 
1996. EMBO 75:12-22, Lohmann,*/ al t 1998. Virology 249:108-1 18.) Preferably, 
the sequence of the active NS5B is substantially similar as that provided in SEQ. ED. 
NO. 1, or a wild type NS5B such as strains HCV-1, HCV-2, HCV-BK, HCV-J, HCV- 
N, HCV-H. A substantially similar sequence provides detectable HCV polymerase 
10 activity and contains 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, or 15 amino acid 

alterations to that present in a HCV NS5B polymerase. Preferably, no more than 1, 2, 
3, 4 or 5 alterations are present. 

Alterations to an amino acid sequence provide for substitution(s), 
insertion(s), deletion(s) or a combination thereof. Sites of different alterations can be 
15 designed taking into account the amino acid sequences of different NS5B polymerases 
to identify conserved and variable amino acid, and can be empirically determined. 

HCV replicons can be produced in a wide variety of different cells and 
in vitro. Suitable cells allow for the transcription of a nucleic acid encoding for an 
HCV replicon. 

20 

Additional Sequences 
An HCV replicon may contain non-HCV sequences in addition to 
HCV sequences. The additional sequences should not prevent replication and 
expression, and preferably serve a useful function. Sequences that can be used to 
25 serve a useful function include a selection sequence, a reporter sequence, transcription 
elements and translation elements. 

Selection Sequence 

A selection sequence in an HCV replicon facilitates the identification 
30 of a cell containing the replicon. Selection sequences are typically used in 

conjunction with some selective pressure that inhibits growth of cells not containing 
the selection sequence. Examples of selection sequences include sequences encoding 
for antibiotic resistance and ribozymes. 

Antibiotic resistance can be used in conjunction with an antibiotic to 
35 select for cells containing replicons. Examples of selection sequences providing for 
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antibiotic resistance are sequences encoding resistance to neomycin, hygromycin, 
puromycin, or zeocin. 

A ribozyme serving as a selection sequence can be used in conjunction 
with an inhibitory nucleic acid molecule that prevents cellular growth. The ribozyme 
5 recognizes and cleaves the inhibitory nucleic acid. 

Reporter Sequence 

A reporter sequence can be used to detect replicon replication or 
protein expression. Preferred reporter proteins are enzymatic proteins whose presence 

10 can be detected by measuring product produced by the protein. Examples of reporter 
proteins include, luciferase, beta-lactamase, secretory alkaline phosphatase, beta- 
glucuronidase, green fluorescent protein and its derivatives. In addition, a reporter 
nucleic acid sequence can be used to provide a reference sequence that can be targeted 
by a complementary nucleic acid. Hybridization of the complementary nucleic acid to 

15 its target can be determined using standard techniques. 

Additional Sequence Configuration 

Additional non-HCV sequences are preferable 5' or 3' of an HCV 
replicon genome or subgenomic genome region. However, the additional sequences 
20 can be located within an HCV genome as long as the sequences do not prevent 

detectable replicon activity. If desired, additional sequences can be separated from 
the replicon by using a ribozyme recognition sequence in conjunction with a 
ribozyme. 

Additional sequences can be part of the same cistron as the HCV 
25 polyprotein or can be a separate cistron. If part of the same cistron, the selection or 
reporter sequence coding for a protein should result in a product that is either active as 
a chimeric protein or is cleaved inside a cell so it is separated from HCV protein. 

Selection and reporter sequences encoding for a protein when present 
as a separate cistron should be associated with elements needed for translation. Such 
30 elements include a 5' IRES. 

Detection Methods 
Methods for detecting replicon activity include those measuring the 
production or activity of replicon RNA and encoded for protein. Measuring includes 
35 qualitative and quantitative analysis. 
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Techniques suitable for measuring RNA production include those 
detecting the presence or activity of RNA. The presence of RNA can be detected 
using, for example, complementary hybridization probes or quantitative PCR. 
Techniques for measuring hybridization between complementary nucleic acid and 
5 quantitative PCR are well known in the art. (See for example, Ausubel, Current 

Protocols in Molecular Biology, John Wiley, 1987-1998, Sambrook, et al, Molecular 
Cloning, A Laboratory Manual, 2 nd Edition, Cold Spring Harbor Laboratory Press, 
1989, and U.S. Patent No. 5,731,148.) 

RNA enzymatic activity can be provided to the replicon by using a 
10 ribozyme sequence. Ribozyme activity can be measured using techniques detecting 
the ability of the ribozyme to cleave a target sequence. 

Techniques for measuring protein production include those detecting 
the presence or activity of a produced protein. The presence of a particular protein 
can be determined by, for example, immunological techniques. Protein activity can 
15 be measured based on the activity of an HCV protein or a reporter protein sequence. 

Techniques for measuring HCV protein activity vary depending upon 
the protein that is measured. Techniques for measuring the activity of different non- 
structural proteins such as NS2/3, NS3, and NS5B, are well known in the art. (See, 
for example, references provided in the Background of the Invention.) 
20 Assays measuring replicon activity also include those detecting virion 

production from a replicon that produces a virion; and those detecting a cytopathic 
effect from a replicon producing proteins exerting such an effect. Cytopathic effects 
can be detected by assays suitable to measure cell viability. 

Assays measuring replicon activity can be used to evaluate the ability 
25 of a compound to modulate HCV activities. Such assays can be carried out by 
providing one or more test compounds to a cell expressing an HCV replicon and 
measuring the effect of the compound on replicon activity. If a preparation containing 
more than one compound is found to modulate replicon activity, individual 
compounds or smaller groups of compounds can be tested to identify replicon active 
30 compounds. 

Compounds identified as inhibiting HCV activity can be used to 
produce replicon enhanced cells and may be therapeutic compounds. The ability of a 
compound to serve as a therapeutic compound can be confirmed using animal models 
such as a chimpanzee to measure efficacy and toxicity. 

35 
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Replicon Enhanced Host Cell 
Replicon enhanced cells are initially produced by selecting for a cell 
able to maintain an HCV replicon and then curing the cell of the replicon. Cells 
produced in this fashion were found to have an increased ability to maintain a replicon 
5 upon subsequent HCV replicon transfection. 

Initial transfection can be performed using a wild-type replicon or a 
replicon containing one or more adaptive mutations. If a wild-type replicon is 
employed, the replicon should contain a selection sequence to facilitate replicon 
maintenance. 

10 Cells can be cured of replicons using different techniques such as those 

employing replicon inhibitory agent. In addition, replication of HCV replicons is 
substantially reduced in confluent cells. Thus, it is conceivable to cure cells of 
replicons by culturing them at a high density. 

Replicon inhibitory agents inhibit replicon activity or select against a 

15 cell containing a replicon. An example of such an agent is IFN-a. Other HCV 
inhibitory compounds may also be employed. HCV inhibitor compounds are 
described, for example, in Llinas-Brunet, etai, 2000. Bioorg Med Chem. Lett. 10(20), 
2267-2270. 

The ability of a cured cell to be a replicon enhanced cell can be 
20 measured by introducing a replicon into the cell and determining efficiency of 
subsequent replicon maintenance and activity. 

EXAMPLES 

Examples are provided below to further illustrate different features of 
25 the present invention. The examples also illustrate useful methodology for practicing 
the invention. These examples do not limit the claimed invention. 

Example 1: Techniques 

This example illustrates the techniques employed for producing and 
30 analyzing adaptive mutations and replicon enhanced cells. 

Manipulation of Nucleic Acids and Construction of Recombinant Plasmids 

Manipulation of nucleic acids was done according to standard 
protocols. (Sambrook, et ai f 1989. Molecular Cloning: A Laboratory Manual, 2 nd ed. 
35 Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y.) Plasmid DNA was 
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prepared from ON culture in LB broth using Qiagen 500 columns according to 

manufacturer instructions. 

Plasmids containing desired mutations were constructed by restriction 

digestion using restriction sites flanking the mutations or by PCR amplification of the 
5 area of interest, using synthetic oligonucleotides with the appropriate sequence. Site 

directed mutagenesis was carried out by inserting the mutations in the PCR primers. 

PCR amplification was performed using high fidelity thermostable polymerases or 

mixtures of polymerases containing a proofreading enzyme. (Barnes, et ai f 1994. 

Proc. Natl. Acad. Sci. 91, 2216-2220.) AH plasmids were verified by restriction 
1 0 mapping and sequencing. 

pHCVneol7.wt contains the cDNA for an HCV bicistronic replicon 

identical to replicon l3 7 7neo/NS3-37wt described by Bartenschlager (SEQ. ID. NO. 3) 

(Lohmann, et aL, 1999. Science 255,1 10-1 13, EMBL-genbank No. AJ242652). The 

plasmid comprises the following elements: 5' untranslated region of HCV comprising 
15 the HCV-IRES and part of the core (ntl-377); neomycin phosphotransferase coding 

sequence; and EMCV IRES; HCV coding sequences from NS3 to NS5B; 3' UTR of 

HCV. 

Plasmid pHCVNeol7.GAA is identical to pHCVNeo.17, except that 
the GAC triplets (nt. 6934-6939 of pHCVNeol7 sequence) coding for the catalytic 
20 aspartates of the NS5B polymerase (amino acids 2737 and 2738 of HCV polyprotein) 
were changed into GCG, coding for alanine. 

Plasmid pHCVNeol7.m0 is identical to pHCVNeol7, except that the 
triplet AGC (nt. 5335-5337 of pHCVNeol7 sequence) coding for the serine of NS5A 
protein (amino acid 2204 of HCV polyprotein) was changed into AGA, coding for 
25 arginine. 

Plasmid pHCVNeol7.ml is identical to pHCVNeol7, except that the 
triplet AAC (nt. 4846-4848 of pHCVNeol7 sequence) coding for the asparagine of 
NS5A protein (amino acid 2041 of HCV polyprotein) was changed into ACC, coding 
for threonine. 

30 Plasmid pHCVNeol7.m2 is identical to pHCVNeol7, except that the 

triplet TCC (nt. 5242-5244 of pHCVNeol7 sequence) coding for the serine of NS5A 
protein (amino acid 2173 of HCV polyprotein) was changed into TTC, coding for 
phenylalanine. 
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Plasmid pHCVNeol7.m3 is identical to pHCVNeol7, except that the 
triplet TCC (nt. 5314-5316 of pHCVNeol7 sequence) coding for the serine of NS5A 
protein (amino acid 2197 of HCV polyprotein) was changed into TTC, coding for 
phenylalanine. 

5 Plasmid pHCVNeol7.m4 is identical to pHCVNeol7, except that the 

triplet TTG (nt. 5317-5319 of pHCVNeol7 sequence) coding for the leucine of NS5A 
protein (amino acid 2198 of HCV polyprotein) was changed into TCG, coding for 
serine. 

Plasmid pHCVNeol7.m5 is identical to pHCVNeol7, except that an 

10 extra triplet AAA coding for lysine was inserted after the triplet GTG (nt. 4840-4843 
of pHCVNeol7 sequence), coding for valine 2039 of HCV polyprotein. 

•Plasmid pHCVNeol7.m6 is identical to pHCVNeol7, except that the 
triplets GAA and GCC (nt. 2329-2331 and 2764-2766 of pHCVNeol7 sequence) 
coding for the glutamic acid and the alanine of NS3 protein (amino acid 1202 and 

15 1347 of HCV polyprotein) were changed respectively into GGA and ACC, coding for 
glycine and threonine. The triplet TCC (nt. 5242-5244 of pHCVNeol7 sequence) 
coding for the serine of NS5A protein (amino acid 2173 of HCV polyprotein) was 
changed into TTC, coding for phenylalanine; an extra adenosine was inserted into the 
EMCV IRES (after the thymidine 1736 of the replicon sequence). 

20 Plasmid pHCVNeol7.m7 is identical to pHCVNeol7, except that the 

triplet A AC (nt. 4846-4848 of pHCVNeol7 sequence) coding for the asparagine of 
NS5A protein (amino acid 2041 of HCV polyprotein) was changed into ACC, coding 
for threonine; the triplet TCC (nt. 5242-5244 of pHCVNeo!7 sequence) coding for 
the serine of NS5A protein (amino acid 2173 of HCV polyprotein) was changed into 

25 TTC, coding for phenylalanine. 

Plasmid pHCVNeol7.m8 is identical to pHCVNeol7, except that the 
triplet AAC (nt. 4846-4848 of pHCVNeol7 sequence) coding for the asparagine of 
NS5A protein (amino acid 2041 of HCV polyprotein) was changed into ACC, coding 
for threonine; the triplet TCC (nt. 5314-5316 of pHCVNeol7 sequence) coding for 

30 the serine of NS5A protein (amino acid 2197 of HCV polyprotein) was changed into 
TTC, coding for phenylalanine. 

Plasmid pHCVNeol7.m9 is identical to pHCVNeol7, except that the 
triplet AAC (nt. 4846-4848 of pHCVNeol7 sequence) coding for the asparagine of 
NS5A protein (amino acid 2041 of HCV polyprotein) was changed into ACC, coding 
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for threonine; the triplet TTG (nt. 5317-5319 of pHCVNeoH sequence) coding for 
the leucine of NS5A protein (amino acid 2198 of HCV polyprotein) was changed into 
TCG, coding for serine. 

Plasmid pHCVNeol7.mlO is identical to pHCVNeoH, except that the 
5 triplet GAA (nt. 2329-233 1 of pHCVNeoH sequence) coding for the glutamic acid of 
NS3 protein (amino acid 1202 of HCV polyprotein) was changed into GGA, coding 
for glycine; an extra triplet AAA coding for lysine was inserted after the triplet GTG 
(nt. 4840-4843 of pHCVNeoH sequence), coding for valine 2039 of HCV 
polyprotein. 

10 Plasmid pHCVNepl7.ml 1 is identical to pHCVNeoH, except that the 

triplet TCC (nt. 5314-5316 of pHCVNeoH sequence) coding for the serine of NS5A 
protein (amino acid 2197 of HCV polyprotein) was changed into TTC, coding for 
phenylalanine. The triplet GCC (nt. 5320-5322 of pHCVNeoH sequence) coding for 
the alanine of NS5A protein (amino acid 2199 of HCV polyprotein) was changed into 

1 5 ACC coding for threonine. 

Plasmid pHCVNeol7.ml2 is identical to pHCVNeoH, except that the 
triplet AAC (nt. 4846-4848 of pHCVNeoH sequence) coding for the asparagine of 
NS5A protein (amino acid 2041 of HCV polyprotein) was changed into ACC, coding 
for threonine; the triplet TCC (nt. 5314-5316 of pHCVNeoH sequence) coding for 

20 the serine of NS5A protein (amino acid 2197 of HCV polyprotein) was changed into 
TTC, coding for phenylalanine. The triplet GCC (nt. 5320-5322 of pHCVNeoH 
sequence) coding for the alanine of NS5A protein (amino acid 2199 of HCV 
polyprotein) was changed into ACC coding for threonine. 

Plasmid pHCVNeol7.ml3 has the same mutations as 

25 pHCVNeol7.m8, but also an extra adenosine inserted into the EMCV IRES (after the 
thymidine 1736 of the replicon sequence). 

Plasmid pHCVNeol7.ml4 has the same mutations as 
pHCVNeol7.ml 1, but also an extra adenosine inserted into the EMCV IRES (after 
the thymidine 1736 of the replicon sequence). 

30 Plasmid pHCVNeol7.ml5 is identical to pHCVNeo!7, except that the 

triplet GCC (nt. 5320-5322 of pHCVNeoH sequence) coding for the alanine of 
NS5A protein (amino acid 2199 of HCV polyprotein) was changed into ACC coding 
for threonine. 
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Plasmid pRBSEAP.5 is a pHCVNeo.17 derivative where the Neo 
coding sequence has been replaced with the sequence coding for the human placental 
alkaline phosphatase corresponding to nucleotides 90-1580 of pBC12/RSV/SEAP 
plasmid. (Berger, etaU 1988. Gem? 66, 1-10.) 

5 

RNA Transfection 

Transfection was performed using Huh-7 cells. The cells were grown 
in Dulbecco's modified minimal essential medium (DMEM, Gibco, BRL) 
supplemented with 10% FCS. For routine work, cells were passed 1 to 5 twice a 

10 week using lx trypsin/EDTA (Gibco, BRL). 

Plasmids were digested with the Seal endonuclease (New England 
Biolabs) and transcribed in vitro with the T7 Megascript kit (Ambion). Transcription 
mixtures were treated with DNase I (0.1 U/ml) for 30 minutes at 37°C to completely 
remove template DNA, extracted according to the procedure of Chomczynski 

15 (Chomczynski, et ai, 1987. Anal. Biochem. 162, 156-159), and resuspended with 
RNase-free phosphate buffered saline (rfPBS, Sambrook, et al f 1989. Molecular 
Cloning: A Laboratory Manual, 2 nd ed. Cold Spring Harbor Laboratory, Cold Spring 
Harbor, N.Y.). 

RNA transfection was performed as described by Liljestrom, et ai, 
20 1991. 7. Virol. 6, 4107-4113, with minor modifications. Subconfluent, actively 

growing cells were detached from the tissue culture container using trypsin/EDTA. 
Trypsin was neutralised by addition of 3 to 10 volumes of DMEM/10%FCS and cells 
were centrifuged for 5 minutes at 1200 rpm in a Haereus table top centrifuge at 4°C. 
Cells were resuspended with ice cold rfPBS by gentle pipetting, counted with a 
25 haemocitometer, and centrifuged as above. rfPBS wash was repeated once and cells 
were resuspended at a concentration of 1-2 x 10 7 cell/ml in rfPBS. Aliquots of cell 
suspension were mixed with RNA in sterile eppendorf tubes. The RNA/cell mixture 
was immediately transferred into the electroporation cuvette (precooled on ice) and 
pulsed twice with a gene pulser apparatus equipped with pulse controller (Biorad). 
30 Depending on the experiment, 0.1, 0.2 or 0.4 cm electrode gap cuvettes were used, 
and settings adjusted (Table 3). 
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TABLE 3 



Cuvette 


Volume 


Voltage 


Capacitance 


Resistance 


RNA 


gap (cm) 


(al) 


(Volts) 


(uFa) 


(ohm) 


<H8> 


0.1 


70 


200 


25 


infinite 


1-10 


0.2 


200 


400 


25 


infinite 


5-20 


0.4 


800 


800 


25 


infinite 


15-100 



After the electric shock, cells were left at room temperature for 1-10 
5 minutes (essentially the time required to electroporate all samples) and subsequently 
diluted with at least 20 volumes of DMEM/10%FCS and plated as required for the 
experiment. Survival and transfection efficiency were monitored by measuring the 
neutral red uptake of cell cultured for various days in the absence or in the presence of 
neomycin sulfate (G418). With these parameters, survival of Huh-7 cells was usually 
10 40-60% and transfection efficiency ranged between 40% and 100%. 

Sequence Analysis of Replicon RNAs 

The entire NS region was recloned from 3 different transfection 
experiments performed with HCVNeo.17 RNA. RNA was extracted from selected 
15 clones either using the Qiagen RNAeasy minikit following manufacturer instructions 
or as described by Chomczynski, et aL, 1987. Anal. Biochem. 762, 156-159. 

Replicon RNAs (5 jxg of total cellular RNA) were retro- transcribed 
using oligonucleotide HCVG34 (5'- ACATG ATCTGC AGAGAGGCCAGT-3' ; SEQ. 
ED. No. 4) and the Superscript II reverse transcriptase (Gibco, BRL) according to 
20 manufacturer instructions, and subsequently digested with 2 U/ml Ribonuclease H 

(Gibco BRL). The cDNA regions spanning from the EMCV IRES to the HCV 3' end 
were amplified by PCR using oligonucleotides HCVG39 (5'- 
GACASGCTGTGATAWATGTCTCCCCC-3'; SEQ. ID. NO. 5) and CITE3 (5'- 
TGGCTCTCCTCAAGCGTATTC -3'; SEQ. ID. NO. 6) and the LA Taq DNA 
25 polymerase (Takara LA Taq). 

Amplified cDNAs were digested with the Kpnl endonuclease (New 
England Biolabs) and the 5.8 kb fragments were gel purified and ligated to the 5.6 kb 
vector fragment (purified from plasmid pRBSEAP.5 digested with Kpnl) using T4 
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DNA ligase (New England Biolabs) according to manufacturer instructions. Ligated 
DNAs were transformed by electroporation in DH10B or JM1 19 strains of E. coli. 

In the case of NS5A region, total RNA isolated from 3 clones, (HB77, 
HB60 and HB68) was extracted and used for RT-PCR. 5fig of total RNA plus 20 
5 pmole of AS61 oligo (5 ' - ACTCTCTGC A GTC AAGCGGCTC A-3 ' , RT antisense 
oligo; SEQ. ID. NO. 7) were heated 5 minutes at 95°C, then DMSO (5% f.c), DTT 
(10 mM f.c), 1 mM dNTP (1 mM f.c), lx Superscript buffer (1 x f.c), and 10 u 
Superscript (Gibco) were added to a total volume of 20 fi\ and incubated 3 hours at 
42°C. 2/il of this RT reaction were used to perform PCR with oligos S39 (5'- 

10 C AGTGG ATG A ACCGGCTG AT A-3 ' , sense; SEQ. ID. NO. 8) or S41 (5'- 

GGGGCG ACGGC ATC ATGC A A ACC-3 ' , sense; SEQ. ID. NO. 9) and B43 (5'- 
C AGG ACCTGC AGTCTGTC A A AGG-3 ' , antisense; SEQ. ID. NO. 10) using 
Elongase Enzyme Mix (Gibco) according the instruction provided by the 
manufacturer. The resulting PCR fragment was cloned in pCR2.1 vector using the 

15 TA Cloning kit (Invitrogen) and transformed in ToplOF' bacterial strain. 

Plasmid DNA was prepared from ON culture of the resulting 
ampicillin resistant colonies using Qiagen 500 columns according to manufacturer 
instructions. The presence of the desired DNA insert was ascertained by restriction 
digestion, and the nucleotide sequence of each plasmid was determined by automated 

20 sequencing. Nucleotide sequences and deduced amino acids sequences were aligned 
using the GCG software. 

TaqMan 

TaqMan analysis was typically performed using 10 ng of RNA in a 
25 reaction mix (TaqMan Gold RT-PCR kit, Perkin Elmer Biosystems) either with HCV 
specific oligos/probe (oligo 1: 5 ' -CGGG AG AGCC AT AGTGG-3 ' ; SEQ. DD. NO. 11, 
oligo 2: 5'-AGTACCACAAGGCCTTTCG-3'; SEQ. ID. NO. 12, probe: 5'- 
CTGCGGAACCGGTGAGTACAC-3'; SEQ. ID. NO. 13) or with human GAPDH 
specific oligos/probe (Pre-Developed TaqMan Assay Reagents, Endogenous Control 
30 Human GAPDH, Part Number 43 10884E, Perkin Elmer Biosystems). PCR was 

performed using a Perkin Elmer ABI PRISM 7700 under the following conditions: 30 
minutes at 48°C (the RT step), 10 minutes at 95°C and 40 cycles: 15 seconds at 95°C 
and 1 minute at 60°C. Quantitative calculations were obtained using the Comparative 
C T Method (described in User Bulletin #2, ABI PRISM 7700 Sequence Detection 
35 System, Applied Biosystem, Dec 1997) considering the level of GAPDH mRNA 
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constant. All calculations of HCV RNA are expressed as fold difference over a 
specific control. 

Antibodies and Immunological Techniques 
5 Mouse monoclonal antibody (anti-NS3 mablOE5/24) were produced 

by standard techniques. (Galfr6 and Milstein, 1981. Methods in Enzymology 73, 1- 
46.) Purified recombinant protein was used as an immunogen. (Gallinari, et aL, 
1999. Biochemistry 38, 5620-5632.) 

For Cell-ELISA analysis, transfected cells were monitored for 

10 expression of the NS3 protein by ELJSA with the anti-NS3 mab 10E5/24. Cells were 
seeded into 96 well plates at densities of 40,000, 30,000, 15,000 and 10,000 cells per 
well and fixed with ice-cold isopropanol at 1, 2, 3 and 4 days post-transfection, 
respectively. The cells were washed twice with PBS, blocked with 5% non-fat dry 
milk in PBS + 0.1% Triton X100 + 0.02% SDS (PBSTS) and then incubated 

15 overnight at 4°C with 10E5/24 mab diluted 1 :2000 in Milk/PBSTS. After washing 5 
times with PBSTS, the cells were incubated for 3 hours at room temperature with 
anti-mouse IgG Fc specific alkaline phosphatase conjugated secondary antibody 
(Sigma A-7434), diluted 1:2000 in Milk/PBSTS. After washing again as above, the 
reaction was developed with p-nitrophenyl phosphate disodium substrate (Sigma 104- 

20 105) and the absorbance at 405 nm read at intervals. 

The results were normalized by staining with sulforhodamine B (SRB 
Sigma S 1402) to determine cell numbers. The alkaline phosphatase substrate was 
removed from the wells and the cells washed with PBS. The plates were then 
incubated with 0.4% SRB in 1% acetic acid for 30 minutes (200 (il/well), rinsed 4 

25 times in 1% acetic acid, blotted dry and then 200 |il/well of lOmM Tris pH 10.5 
added. After mixing, the absorbance at 570 nm was read. 

Neutral Red/ Crystal Violet Staining of Foci 

The survival of transfected cells in the absence or presence of G418 

30 was monitored by staining of foci/clones with neutral red in vivo with subsequent 
crystal violet staining. The medium was removed from the cells and replaced with 
fresh medium containing 0.0025% neutral red (Sigma N2889) and the cells incubated 
for 3 hours at 37°C Cells were washed twice with PBS, fixed in 3.5% formaldehyde 
for 15 minutes, washed twice again in PBS and then with distilled water and the 

35 number of (live) foci counted. The cells could then be re-stained with crystal violet 
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by incubating with an 0.1% crystal violet (Sigma C0775) solution in 20% methanol 
for 20 minutes at room temperature, followed by 3 washes in 20% methanol and a 
wash with distilled water. 

5 Preparation Of Cells Cured Of Endogenous Replicon 

Replicon enhanced cells designated 10IFN and C1.60/cu were produced 
using different HCV inhibitory agents. Based on the techniques described herein 
additional replicon enhanced clones can readily be obtained. 

10IFN was obtained by curing a Huh-7 cell of a replicon using human 

10 IFN-oc2b. Huh-7 cells containing HCV replicons (designated HBI10, HBffl4, HBffl27 
and HBIII18) were cultured for 1 1 days in the presence of 100 U/ml recombinant 
human DFN-cc2b (Intron-A, Schering-Plough), and subsequently for 4 days in the 
absence of IFN-a2b. At several time points during this period, the clones were 
analyzed for the presence of HCV proteins and RNA by Western and Northern 

15 blotting. After 7 days of incubation with BFN-a2b, HCV proteins could no longer be 
detected in any of these clones by Western blotting and similar effects were seen with 
RNA signals in Northern blots. IFN-ot2b treated cells were stored in liquid nitrogen 
until used for transfection experiments. 

C1.60/cu was obtained by curing a Huh-7 cell of a replicon using an 

20 HCV inhibitory compound. The presence of HCV RNA was determined using PCR 
(TaqMan) at 4, 9, 12 and 15 days. From day 9 the amount of HCV RNA was below 
the limit of detection. To further test the disappearance of the replicon, 4 million cells 
of cured Clone 60 cells (after the 15 days of treatment) were plated in the presence of 
1 mg/ml G-418. No viable cells were observed, confirming that absence of HCV 

25 replicons able to confer G-418 resistance. 

Example 2: Selection and Characterization of Cell Clones Containing Functional 
HCV Replicons 

Huh-7 cells (2-8xl0 6 ) were transfected by electroporation with in vitro 
30 transcribed replicon RNAs (10-20 ng), plated at a density ranging from 2.5xl0 3 to 
10xl0 3 /cm 2 , and cultured in the presence of 0.8-1 mg/ml G418. The majority of 
replicon transfected cells became transiently resistant to G418 and duplicated 
normally for 7 to 12 days in the presence of the drug, while cells transfected with 
irrelevant RNAs and mock transfected cells did not survive more than 7 days (data not 
35 shown). Transient resistance to G418 was likely due to persistence of the Neo protein 
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expressed from the transfected RNA, since it was observed also with mutated 
replicons unable to replicate. Approximately 2 weeks after transfection, transient 
resistance declined, most cells died and small colonies of cells permanently resistant 
to the antibiotic became visible in samples transfected with HCVNeo.17 RNA, but 
5 not in cells transfected with other replicon RNAs. 

In several experiments, the frequency of G418 resistant clones ranged 
between 10 and 100 clones per 10 6 transfected cells. About 20 G418 resistant 
colonies were isolated, expanded and molecularly characterized. PCR and RT-PCR 
analysis of nucleic acids indicated that all clones contained HCV RNA but not HCV 

10 DNA, demonstrating that G418 resistance was due to the presence of functional 

replicons (data not shown). This result was confirmed by Northern blot analysis and 
metabolic labeling with 3H-uridine, which revealed the presence of both genomic and 
antigenomic HCV RNAs of the expected size (data not shown). Lastly, western blot, 
immunoprecipitation and immunofluorescence experiments showed that these clones 

15 expressed all HCV non-structural proteins as well as Neo protein (data not shown). 

Clones differed in terms of cell morphology and growth rate. Replicon 
RNA copy number (500-10000 molecules/cell) and viral protein expression also 
varied between different clones (data not shown). However, the amount of replicon 
RNA and proteins also varied with passages and with culture conditions and was 

20 higher when cells were not allowed to reach confluency, suggesting that replicons 
replicated more efficiently in dividing cells than in resting cells. Processing of the 
viral polyprotein occurred with kinetics similar to those observed in transfected cells. 

Interestingly, in all tested clones HCV replication was efficiently 
inhibited by treating the cells with EFN-cc2b. The EC50 was between 1 and 10 U/ml, 

25 depending on the clone. 



Example 3: Identification of Adaptive Mutations 

The low number of G418 resistant clones derived from HCVNeo.17 
RNA transfection suggested that replication could require mutation(s) capable of 
30 adapting the replicon to the host cell (adaptive mutations) and/or that only a small 
percentage of Huh-7 cells were competent for HCV replication. To verify the first 
hypothesis, mutations in replicons RNAs derived from selected cell clones were 
identified. 
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RNA sequences for different replicons were determined using standard 
techniques. Such techniques involved isolating RNA from several independent 
clones, reverse transcription to produce cDNA, amplifying cDNAs by PCR and 
cloning into an appropriate vector. The cDNA spanning almost the entire HCV NS 
5 region (126 bp at the 3' end of the EMCV IRES and 5650 bp of the HCV NS region 
(i.e., the entire NS ORF and 298 nucleotides at the 3' end) from 5 clones (HBI10, 
HBID12, HBIII18, HBffl27, HBIV1) were recloned and sequenced. In addition, the 
NS5A coding region (nt. 4784-6162) from 3 additional clones (HB 77, HB 68 and HB 
60) were recloned and sequenced. 
10 To discriminate mutations present in the replicon RNA from those 

derived from the cloning procedure, at least 2 isolates derived from independent RT- 
PCR experiments were sequenced for each cell clone. Comparison of the nucleotide 
sequences with the parental sequence indicated that each isolate contained several 
mutations (Tables 4A and 4B). 

15 

TABLE 4A 



Cell clone 


HBIII 12 


HBIII 18 


HBI 10 


HBIII 27 


isolate 


4 


29 


28 


61 


12 


43 


13 


72 




1674- 
7460 


1674- 
7460 


1674- 
7460 


1674- 
7460 


1674- 
7460 


1674- 
7460 


1674-7460 


1674-7460 


EMCV 
IRES 
126 bp 


A @ 1736 


A @ 1736 




C 1752 T 








T 1678 C 


NS3 
1895 bp 


G 2009 C 

A 2698 G 
G 2764 A 

A 3256 G 
T3273C 


A 2330 G 

C2505T 
G 2764 A 

T 3085 C 


T2150C 
C 2196 A 
T 3023 A 
T3134C 
C3267T 


T 2015 C 

A 2338 G 
C 2616 T 
A 2664 G 
A 3148 G 
T 3286 C 
C3615T 
C3657T 


T 1811 A 
A 2330 G 

T 2666 C 
T 3395 C 


A 2330 G 

A 2882 G 
T 3673 C 


G 2009 C 
T2015C 

C 2336G 
A3130T 
A 3401 G 
A3518C 


G 2009 C 

C 2052 A 
G 2644 A 
C 2803 A 
T 2823 A 
T 3692 C 


NS4A 
161 bp 


T 3790 C 




A 3847 G 


T 3827 A 


T 3742 C 




A 3743 G 


A 3797 G 


NS4B 
782 bp 


T 3869 C 
A 4107 G 
T4185C 
A 4428 G 


C 4283 T 
C4429T 


G 4300 A 


A4136G 
A 4261 G 
G 4309 A 
A 4449 G 


T4290C 


A 4053 G 
A 2496 C 
T4316G 


G 3880 A 
T4200C 
A 4366 G 


C 4547 T 
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TABLE 4A 



Cell clone 


HBIII 12 


HBIII 18 


HBI 10 


HBIII 27 


isolate 


4 


29 


28 


61 


12 


43 


13 


72 




1674- 


1674- 


1674- 


1674- 


1674- 


1674- 


1674-7460 


1674-7460 




7460 


7460 


7460 


7460 


7460 


7460 






i NS5A 


A 4847 C 


G 4728 A 


C 5243 T 


C 4729 A 


A 4694 T 


A 4675 G 


A 4855 G 


A 4888 G 


1340 bp 


G 5158 A 


A 4845 G 


A 5486 G 


T4993C 


AAA @ 
4842 


A 4761 G 


C5006T 


C 4985 T 




G5J75C 


C 5243 T 


C 5596 T 


G5095 A 


T 5237 C 


AAA @ 
4842 


T 5318 C 


T 5030 A 




C 5243 T 


G5512T 


G 5823 A 


T5334 C 




T5368C 


A 5574 G 


T 5090 A 




C5390T 


A 5521 G 




A 5374 T 






G 5866 A 


T5318C 




A 5719 G 


A 5600 G 
A 5740 C 




T 5379 A 
T5480C 
A5513G 
T 5977 C 








A 5328 G 
A 5399 G 
A 5574 G 


NS5B 


T6316C 


A 6406 G 


T 6074 C 


A 6150 G 


A 6911 G 


A 5986 G 


G6479C 


G 6156 A 


1477 bp 


T 6589 C 


G 6756 A 


A 6541 G 


A 6218 G 




T6099C 


C6870T 


G 7434 A 




T 7370 C 


G6963T 


A 6732 G 
A 7350 T 
A 7359 G 


T 7352 A 




C6141 T 
G 6463 A 
C6849T 
T 6865 C 


A7213G 
T 7448 C 


T 7444 C 



Clone name and isolate number are indicated in the first and second row, respectively. 
The first and the last nucleotide of the region that was recloned and sequenced are indicated in the third 
5 row. 

Nucleotide (IUB code) substitutions are indicated with the original nucleotide, its position and mutated 
nucleotide. 

Nucleotide(s) insertions are indicated with the nucleolide(s), the symbol @ and the position of the 

nucleotide preceding insertion. 
1 0 Numbering refers to the first nucleotide of the replicon sequence (EMBL-genbank No. AJ242652). 

The region in which mutations are located and the nucleotide length of each region are indicated in the 

left most column. 

Silent mutations are in italic. 

Non sense mutations are underlined. 
1 5 Consensus mutations are bold. 



TABLE 4B 



Cell clone 


HBIV1 


HB 77 


HB 68 


HB 60 


isolate 


85 


93 


10 


14 


42 


I 


13 


7 




1674- 
7460 


1674- 
7460 


4784- 
6162 


4465- 
6162 


4784- 
6162 


4465- 
6162 


4784-6162 


4784-6162 


EMCV 
IRES 
126 bp 




A @ 1736 














NS3 
1895 bp 


A 3403 G 


A 2572 G 
A 3454 G 
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TABLE 4B 



Cell clone 


HBIV1 


HB 77 


HB 68 


HB60 


isolate 


85 


93 


10 


14 


42 


1 


13 


1 




1674- 
7460 


1674- 
7460 


4784- 
6162 


4465- 
6162 


4784- 
6162 


4465- 
6162 


4784-6162 


4784-6162 


NS4A 
161 bp 


















NS4B 
782 bp 


A 4084 G 


C 3892 T 
















NS5A 
1340 bp 


T 4742 C 
C 5315 T 

U 5431 T 


A 4847 C 

A 5225 G 
C5315 r 
G 5320 A 
T 5356 A 
G 5523 A 
T 5888 A 


C4813T 
G5060C 

/~t can a 

C 5337 A 


A 4699 C 
A5161 G 
C 5337 A 
A 5459 G 
T 5977 C 


T5171 G 
C5298T 
C 5337 A 
A 5639 G 
A 5969 G 


T4587C 
T 4972 C 
A 5094 G 
A 5278 G 
G 5320 A 
C 55327 


A 4821 G 
G 5320 A 
A 5414 G 
T 5601 G 
C5808T 


C 5337 G 
C5551 T 
G 5806 A 


T575J C 
T 5797 C 


NS5B 
1477 bp 


T 6144 A 

A 6365 G 
A 6656 G 
A 6677 G 
T6855C 
T 6947 A 
T 6997 C 
G704I T 
A7187C 


T6855C 
A 7135 G 
T7I7I C 















See Table 4A legend. 

The frequency of mutations ranged between 1 .7 x 10* 3 and 4.5 x 1 0 3 
5 (average 3 x 10°). The majority of mutations were nucleotide substitutions, although 
insertions of 1 or more nucleotides were also observed (Tables 4A and 4B). 

Approximately 85% of the mutations found only in 1 isolate (non- 
consensus) were randomly distributed in the recloned fragment, and possibly include 
mis-incorporation during the PCR amplifications. Conversely, the remaining 15% of 
10 the mutations were common to 2 or more isolates derived from independent RT-PCR 
experiments (consensus mutations), and presumably reflected mutations present in the 
template RNA. 

Consensus mutations were found in all isolates and were either 
common to isolates derived from the same clone (consensus A), or to isolates derived 
15 from different clones (consensus B). Analysis of additional isolates derived from the 
same cell clones indicated that consensus A mutations were not always present in all 
isolates derived from one clone (data not shown). This observation, together with the 
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presence of consensus B mutations, suggests that, even within a single cell clone, 
replicons exist as quasi-species of molecules with different sequences. 

At variance with non-consensus mutations, consensus mutations were 
not randomly distributed but were clustered in the regions coding for the NS5A 
5 protein (frequency 1 x 10* 3 ) and for the NS3 protein (frequency 0.5 x 10' 3 ). Only one 
consensus mutation was found in the region coding for the NS5B protein (frequency 
0.1 x 10* 3 nucleotides) and none in the regions coding for NS4A and NS4B. 
Interestingly, 1 consensus mutation was observed also in the EMCV IRES. 

With the exception of 2 silent mutations found in NS5A and NS5B, 

10 consensus mutations occurring in the NS region resulted in changes in the deduced 
amino acid sequence (Tables 5A and 5B). Noticeably, these amino acid changes 
occurred in residues that are conserved in all or most natural HCV isolates. 
Interestingly, clones HB 77 and HB 60 displayed different nucleotide substitutions 
(C5337A and C5337G, respectively) resulting in the same amino acidic mutation (S 

15 2204 R). 



TABLE 5A 



Cell clone 


HBI1I 12 


HB1I1 18 


HB1 10 


HBIII27 


isolate 


4 


29 


28 


61 


12 


43 


13 


72 


NS3 


G 1095 A 
A 1347 T 


E 1202 G 
A 1347 T 






E 1202 G 


E 1202 G 


G 1095 A 


G 1095 A 


NS4A 


















NS4B 


















NS5A 


N2041 T 
S2I73F 


S2173F 


S2173F 


E2263 


K @ 2039 


K @ 2039 


L2198S 
R 2283 R 


L2198S 
R 2283 R 


NS5B 



















See Table 4A legend. 
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TABLE 5B 



Cell clone 


HBIV1 


HE 11 


HB 68 


HB60 


isolate 


85 


93 


1 10 


14 


42 


1 


13 


! 7 


NS3 


















NS4A 


















NS4B 


















NS5A 


S2197F 


N2041 T 

S2197F 
A 2199 T 


S2204 
R 


S2204 
R 


S2204R 


A 2199 T 


A 2199 T 


S2204R 


NS5B 


N2710N 


N2710N 















See Table 4A legend. 
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Example 4: Functional Characterization of Consensus Mutations 

The identification of consensus mutations in recloned replicons 
indicated that replication proficiency of replicon RNAs contained in selected cell 
clones depended from the presence of such mutations. To substantiate this 
hypothesis, the effect of several consensus mutations on replication were analyzed. 

Consensus mutations found in the NS5A region were more closely 
analyzed. Consensus mutations were segregated from the non-consensus ones, and 
pHCVNeo.17 derivatives containing single or multiple consensus mutations were 
constructed (Table 6). 



TABLE 6 



Construct 



Consensus mutations 



G418cfu/10 :> 
trans fee ted 
cells 



P HCVNeol7.wt 
pHCVNeol7.GAA 
pHCVNeol7.m0 
P HCVNeol7.ml 
P HCVNcol7.m2 
pHCVNeol7.m3 
P HCVNeol7.m4 



NS3 



NS5A 



S2204R 
N204IT 
S2173F 
S2197F 
L2198S 



EMCV IRES 



0-3 
0 

30-130 
0-3 

15-60 
160-500 

30-50 
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TABLE 6 



("^ruictriirt 


Consensus mutations 


CIA 1 Q ^f. . / 1 fv> 

U4 1 o ct u/ 1 U 
cells 




NS3 


NS5A 


EMCV IRES 




pHCVNeol7.m5 




K@2039 




25-55 


pHCVNeol7.m6 


E1202G;A1347T 


S2173F 


Extra A 


13-100 


P HCVNeol7.m7 




N2041T;S2173F 




0-1 


P HCVNeol7.m8 




N2041T;S2197F 




360-500 


pHCVNeol7.m9 




N2041T;L2198S 




140-170 


pHCVNeol7.mlO 


E1202G 


K@2039 




1060 


pHCVNeol7.mll 




S2197F; A2199T 




900 


pHCVNeol7.ml2 




N2041T; S2197F; A2199T 




>1000 


P HCVNeol7.ml3 




N2041T;S2197F 


Extra A 


100 


pHCVNeol7.ini 4 




S2197F;A2199T 


Extra A 


>500 


pHCVNeol7.ml5 

IL L -7 /~ 1rt6v 




A2199T 




300-600 



Huh-7 cells (2x10 ) were transfected with 10 \ig of RNA transcribed from the indicated constructs. 
Approximately 2xl0 5 cells were plated in a 10 cm tissue culture dish and cultured with 1 mg/ml G418 
for 20 days. 

Colonies surviving selection were stained with crystal violet and counted. 



RNAs transcribed in vitro from these constructs were transfected in 
Huh-7 cells and the affect on replication was estimated by counting neomycin 
resistant colonies (G418 cfu). As shown in Table 6, all but 1 construct containing 
single consensus mutations showed a significant increase on G418 cfu efficiency, thus 
10 indicating that the corresponding mutations improved replication. Noticeably, 2 
mutants containing single mutations in NS5A (m3 and ml5) were clearly more 
effective than all other single mutants.. Results of mutants containing 2 or more 
mutations, indicated the presence of a synergistic effect in some combinations (m8, 
m9, mil and possibly mlO), but also a slightly antagonistic effect in 1 mutant (m7). 

15 

Example 5: Replicon Replication in the Absence of Selection 

Replication of HCV replicons in the absence of a G418 selection was 
detected using quantitative PCR (TaqMan). At 24 hours post-transfection a large 
amount of replicon RNA was detected in cells transfected with all replicons, including 
20 the GAA control replicon containing mutations in the catalytic GDD motif of the 

NS5B polymerase. This result suggested that analysis at very early time points (up to 
48 hour post-transfection) essentially measured the input RNA. Northern blot 
analysis also indicated that after 24 hours the majority of the transfected RNA was 
degraded intracellular^ (data not shown). 
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Analysis at later time points showed that the amount of replicon RNA 
was considerably reduced at 4 days and eventually became undetectable (6/8 days) in 
cells transfected with replicon HCVNeol7.wt, but was still high in cells transfected 
with replicons mO, m3 and ml5 (Table 7). At day six, that the amount of replicon 
5 RNA became undetectable in cells transfected with replicon HCVNeol7.wt, mO, and 
m2, but was detectable in cells transfected with replicon m3 and ml5 (Table 7). 



TABLE 7 



Name 


Hu H7 


RNA equ. 


RNA equ. 




day 4 


day 6 


Wt 


1 X 


1 X 


hcvneol7.m0 


3x 


1 X 


hcvneol7.m2 


1 X 


1 X 


hcvneol7.m3 


5x 


3x 


hcvneol7.ml5 


6x 


5x 



10 

Persistence of mO, m3 and m!5 replicons RNA was abolished by 
treatment with interferon-a or with an HCV inhibitory compound (data not shown). 
Moreover, RNA persistence was not observed with mutated replicons carrying the 
NS5B GAA mutation besides adaptive mutations (data not shown). Taken together, 

15 these results demonstrated that quantitative PCR could be used to monitor replication 
at early times post-transfection, and can be used to evaluate the replication proficiency 
of replicon RNAs containing mutations. 

Comparison of the results shown in Tables 6 and 7, indicated that there 
was a good correlation between the amount of replicon RNA detected by TaqMan and 

20 the G418 cfu efficiency. Nonetheless, some mutants (m2, m3) showed a pronounced 
effect on G418 cfu efficiency, and little if any effect on early replication as measured 
by TaqMan PCR, while other mutants (mO) showed the reverse behavior. 



30 



WO 02/059321 



PCT/EP02/00526 



Example 6: HCV Replicon Enhanced Cells 

HCV replicon enhanced cells were produced by introducing an HCV 
replicon into a host, then curing the host of the replicon. Adaptive mutations (or 
combinations of them) by themselves increased up to 2 orders of magnitude the G418 
5 cfu efficiency and enhanced early replication comparably. Nonetheless, even with the 
most effective mutants, only a small percentage of transfected cells (<5 %, data not 
shown) gave rise to G418 resistant clones containing functional replicons. This 
observation was attributed, at least in part to a low cloning efficiency of Huh-7 cells 
(data not shown), and only a fraction of Huh-7 cells being competent for replication. 
10 Several clones were cured of endogenous replicons by treating them 

for about 2 weeks with IFN-ot or with a HCV inhibitory compound. Analysis at the 
end of the treatment showed that neither viral proteins nor replicon RNA could be 
detected. 

Cured cells (10IFN and C1.60/cu) were transfected with mutated 
15 replicons and replication efficiency was determined by counting neomycin resistant 
clones (10IFN) or by TaqMan (10EFN and C1.60/cu). As shown in Table 8, for all 
tested replicons the G418 cfu efficiency in 101FN cells was at least 5 fold higher than 
in parental Huh-7 cells. This increase in G418 cfu efficiency was particularly relevant 
for a subset of mutants (m3, m5, m8, m9, m!5). 

20 

TABLE 8 



Construct 


Consensus mutations 


G418CIU/10* 
transfected cells 




NS3 


NS5A 


EMCV IRES 




pHCVNeol7.wt 








12-56 


pHCVNeol7.GAA 








0 


P HCVNeol7.m0 




S2204R 




180- 1000 


pHCVNeol7.ml 




N2041T 




8-13 


pHCVNeol7.m2 




S2173F 




2000 


pHCVNeol7.m3 




S2197F 




1600-3000 1 


P HCVNeol7.m4 




L2198S 




190-650 


pHCVNeol7.m5 




K@2039 




1600-3000 


pHCVNeol7.m6 


E1202G; A1347T 


S2173F 


extra A 


600 - 2000 


pHCVNeo!7.m7 




N204!T;S2173F 




170- 800 


pHCVNeol7.m8 




N204IT; S2197F 




>4000 


pHCVNeol7.m9 




N204IT; L2198S 




1400- 3000 


pHCVNcol7.m!0 


E1202C 


K@2039 




>4000 


pHCVNeol7.mll 




S2197F; A2I99T 




>4000 
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TABLE 8 



Construct 


Consensus mutations 


G418cfu/10 5 
transfected cells 




NS3 


NS5A 


EMCV IRES 




pHCVNeol7.ml2 




N2041T; S2197F; A2199T 




>4000 


pHCVNeoi7.ml3 




N2041T; S2197F 


extra A 


>4000 


P HCVNeol7.ml4 




S2197F;A2199T 


extra A 


>4000 


P HCVNeol7.ml5 




A2199T 




>4000 



10IFN cells (2x10 ) were transfected with 10 ug of RNA transcribed from the indicated constructs. 
Approximately 2xl0 5 cells were plated in a 10 cm tissue culture dish and cultured with 1 mg/ml G418 
for 20 days. 

Colonies surviving selection were stained with crystal violet and counted. 



Strikingly, the best mutants yielded a number of G418 resistant clones 
ranging between 20 and 80% of the cell clones which grew in the absence of G418 
10 (data not shown), thus indicating that the majority of 10IFN cells were competent for 
replication. This result was confirmed by TaqMan analysis (Table 9), in which the 
fold increase versus the parental Huh~7 cells was very high. The data indicates that 
replicons carrying adaptive mutations replicate vigorously in replicon enhanced cells 
such as 10IFN and Cl.60/cu. 

15 

TABLE 9 



Name 


ior 


FN 


C1.60/CU. | 


RNA equ. 


RNA equ. 


RNA equ. 


RNA equ. 




Day 4 


day 6 


day 4 


Day 6 


Wt 


1 X 


1 X 


1 X 


1 X 


hcvneol7.m0 


46 x 


12x 


78 x 


512x 


hcvneol7.m2 


2x 


2x 


1 x 


2x 


hcvneol7.m3 


68 x 


49 x 


19 x 


392 x 


hcvneol7.ml5 


247 x 


80 x 


268 x 


5518 x 



Expression of viral proteins was determined in replicon enhanced cells 
20 using an ELISA assay designed to detect the NS3 protein in transfected cells plated in 
96 wells microtiter plates (Cell-ELISA). As shown in Table 10, 24 hours post- 
transfection cells transfected with all tested replicons expressed low but detectable 
levels of the NS3 protein. 
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TABLE 10 





NS3 arbitrary units 




24 hp 


).t. 


96 r 


p.t. 


Name 




+ IFN 




+IFN 


Construct 










Mock 


1 


1 


1 


1 


pHCVNeoH.wt 


3.7 


4.2 


1.2 


1.3 


pHCVNeoH.GAA 


3.1 


3.2 


1.1 


1 


pHCVNeol7.mO 


3.4 


3.2 


9.9 


0.8 


pHCVNeol7.m3 


5.7 


4.6 


4.7 


1.5 


pHCVNeol7.m8 


6.6 


5.1 


15.1 


1.4 


pHCVNeol7.mlO 


8 


5.6 


9.2 


1.8 


pHCVNeol7.mll 


8.4 


6.2 


13.6 


1.8 



10IFN cells (2xI0 6 ) were transfected with 1 0 ug of RNA transcribed from the indicated constructs. 
Cells were plated in 96 wells microtiter plates as indicated in Example 1. 



5 Where indicated (+IFN), IFN-a ( J 00 U/ml) was added to the culture medium 4 hours post-transfection. 
At the indicated times post-transfection, cells were fixed and analyzed by Cell-ELISA. 

The early expression shown in Table 10 is likely due to translation of 

transfected RNA, since it was comparable in all replicons (including that carrying the 

GAA mutation) and was not affected by IFN-a. At 4 days post-transfection, NS3 

10 expression persisted or increased in cells transfected with replicons carrying 

consensus mutations, but could not be detected anymore in cells transfected with wt 
and GAA replicons. In addition, NS3 expression was almost completely abolished 
when cells were cultured in the presence of IFN-a. 

Taken together, these results indicated that the level of NS3 expression 

15 reflected the replication rate. Indeed, NS3 expression level (Table 10) paralleled the 
RNA level measured by TaqMan (Table 9). The high replication proficiency of 
10EFN cells was further confirmed by immunofluorescence experiments which 
showed that more than 50% of cells transfected with replicons m8 and ml 1 expressed 
high level of viral proteins, and that expression was almost completely abolished by 

20 IFN-a. 

Example 7: Replication of Full Length Constructs 

This example illustrates the ability of a full length HCV genome 
containing adaptive mutations described herein to replicate in a replicon enhanced 
25 host cell. The full length sequence of the HCV isolate Con-1 (EMBL-Genbank No. 
AJ238799) (plasmid pHCVRBFL.wt) and 2 derivatives containing either the N2041T 
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and S2173 F mutations (plasmid pHCVRBFL.m8) or the S2197F and A2199T 
mutations (plasmid pHCVRBFL.ml 1) were used as starting constructs. 

RNAs transcribed from the starting constructs were transfected in 
10DFN cells and their replication proficiency was assessed by Cell-ELISA, 
5 immunofluorescence and TaqMan. Both constructs containing consensus mutations 
(pHCVRBFL.m8 and pHCVRBFL.ml 1) replicated, while no sign of replication was 
observed with the wt. construct (data not shown). 

Example 8: Replicons with Reporter Gene 

10 This example illustrates an HCV replicon containing adaptive 

mutations and a reporter gene. A pHCVNeol7.wt derivative where the Neo coding 
region was substituted with that coding for human placental secretory alkaline 
phosphatase (pRBSEAPS.wt) and a derivative also containing the N2041T and 
S2173F mutations (plasmid pRBSEAP5.m8) were constructed. RNAs transcribed 

15 from these plasmids were transfected in 10IFN cells and their replication proficiency 
was assessed by measuring secretion of alkaline phosphatase. Analysis of the kinetics 
of secretion suggested that only plasmid pRBSEAP5.m8 was competent for 
replication (data not shown). 

20 Example 9: SEP. ID. Nos. 1 and 2 

SEQ. ED. NOs. 1 and 2 are provided as follows: 

SEP. ID. NO. 1 

MSTNPKPQRKTKRNTNRRPQDVKFPGGGQIVGGVYLLPRRGPRLGVRATRKT 
25 SERSQPRGRRQPIPKARQPEGRAWAQPGYPWPLYGNEGLGWAGWLLSPRGS 
RPSWGPTDPRRRSRNLGKVIDTLTCGFADLMGYIPLVGAPLGGAARALAHGV 
RVLEDGVNYATGNLPGCSFSIFLLALLSCLTIPASAYEVRNVSGVYHVTNDCS 
NASIVYEAADMMHTPGCVPCVRENNSSRCWVALTPTLAARNASVPTTTIRR 
HVDLLVGAAALCSAMYVGDLCGSVFLVAQLFTFSPRRHETVQDCNCS1YPGH 
30 VTGHRMAWDMMMNWSPTAALVVSQLLRIPQAVVDMVAGAHWGVLAGLA 
YYSMVGNWAKVLIVMLLFAGVDGGTYVTGGTMAKNTLGITSLFSPGSSQKIQ 
LVNTNGSWHINRTALNCNDSLNTGFLAALFYVHKFNSSGCPERMASCSPIDAF 
AQGWGP1TYNESHSSDQRPYCWHYAPRPCGIVPAAQVCGPVYCFTPSPVVVG 
TTDRFGVPTYSWGENETDVLLLNNTRPPQGNWFGCTWMNSTGFTKTCGGPP 
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CNIGGIGNKTLTCPTDCFRKHPEATYTKCGSGPWLTPRCLVHYPYRLWHYPC 
TVNFTIFKVRMYVGGVEHRLEAACNWTRGERCNLEDRDRSELSPLLLSTTEW 
QVLPCSFTTLPAl^TGLIHLHQNVVDVQYLYGIGSAVVSFAIKWEYVLLLFLLL 
ADARVCACLWMMLLIAQAEAALENLVVLNAASVAGAHGILSFLVFFCAAWY 
5 IKGRLVPGAAYALYGVWPLLLLLLALPPRAYAMDREMAASCGGAVFVGLILL 
TLSPHYKLFLARLIWWLQYFITRAEAHLQVW1PPLNVRGGRDAVILLTCAIHPE 
LIFTITKIliAILGPmVLQAGrrKVPYFVRAHGLIRACMLVRKVAGGHYVQM 
ALMKLAALTGTYVYDHLTPLRDWAHAGLRDLAVAVEPVVFSDMETKVITW 
GADTAACGDnLGLPVSARRGREIHLGPADSLEGQGWRLLAPITAYSQQTRGL 

10 LGCnTSLTGRDRNQVEGEVQVVSTATQSFLATCVNGVCWTVYHGAGSKTLA 
GPKGPITQMYTNVDQDLVGWQAPPGARSLTPCTCGSSDLYLVTRHADVIPVR 
RRGDSRGSLLSPRPVSYLKGSSGGPLLCPSGHAVGIFRAAVCTRGVAKAVDFV 
PVESMETTMRSPVFTDNSSPPAVPQTFQVAHLHAPTGSGKSTKVPAAYAAQG 
YKVLVLNPSVAATLGFGAYMSKAHGBDPNIRTGVRTITTGAPITYSTYGKFLA 

1 5 DGGCSGG AYDIflCDECHSTDSTTILGIGTVLDQAETAGARLVVLATATPPGS V 
TVPHPNDEEVALSSTGEIPFYGKAIPIETIKGGRHLIFCHSKKKCDELAAKLSGLG 
LNAVAYYRGLDVSVIPTSGDVrVVATDALMTGFTGDFDSVIDCNTCVTQTVD 
FSU5PTFTJETTTVPQDAVSRSQRRGRTGRGRMG1YRFVTPGERPSGMFDSSVL 
CECYDAGCAWYELTPAETSVRLRAYLNTPGLPVCQDHLEFWESVFTGLTHID 

20 AHFLSQTKQAGDNFPYLVAYQATVCARAQAPPPSWDQMWKCLIRLKPTLHG 
PTPLLYRLGAVQNEVTTTHPITKYIMACMSADLEVVTSTWVLVGGVLAALAA 
YCLTTGSVVIVGRHLSGKPAIIPDREVLYREFDElVtEECASHLPYIEQGMQLAEQ 
FKQKAIGLLQTATKQAEAAAPVVESKWRTLEAFWAKHMWNFISGIQYLAGLS 
TU>GNPAIAS1JVIAFTASITSPLTTQHTLLFN1LGGWVAAQLAPPSAASAFVGAG 

25 IAGAAVGSIGLGKVLVDILAGYGAGVAGALVAFKVMSGEMPSTEDLVNLLPA 
1LSPGALVVGVVCAAILRRHVGPGEGAVQWMNRLIAFASRGNHVSPTHYVPE 
SDAAARVTQILSSLTITQLLKRLHQWINEDCSTPCSGSWLRDVWDWICTVLTD 
FKTWLQSKLLPRLPGVPFFSCQRGYKGVWRGDGIMQTTCPCGAQ1TGHVKNG 
SMRIVGPRTCSNTWHGTFPINAYTTGPCTPSPAPNYSRALWRVAAEEYVEVT 

30 RVGDFHYVTGMTTDNVKCPCQVPAPEFFTEVDGVRLHRYAPACKPLLREEV 
TFLVGLNQYLVGSQLPCEPEPDVAVLTSMLTDPSHITAETAKRRLARGSPPSL 
ASSSASQLSAPSLKATCTTRHDSPDADLIEANLLWRQEMGGNITRVESENKVV 
1LDSFEPLQAEEDEREVSVPAEILRRSRKFPRAMP1WARPDYNPPLLESWKDPD 
YVPPVVHGCPLPPAKAPPIPPPRRKRTVVLSESTVSSALAELATKTFGSSESSA 

35 VDSGTATASPDQPSDDGDAGSDVESYSSMPPLEGEPGDPDLSDGSWSTVSEE 
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ASEDVVCCSMSYTWTGALITPCAAEETK1.PINALSNSIXRHHNLVYATTSRSA 

SLRQKKVTEDRLQVU)DHYRDVLKEMKAKASTVKAKI^ 

ARSKFGYGAKDVRNl^SKAVNHIRSV^ 

PEKGGRKPARLIVFPDLGVRVCEKMALYDVVSTLPQAVMGSSYGFQYSPGQR 
5 VEFLVNAWKAKKCPMGFAYDTRCFDSTVTENDIRVEESIYQCCDLAPEARQA 
mSLTERLYIGGPLTNSKGQNCGYRRCRASGVLTTSCGNTLTCYLKAAAACRA 
AKLQDCTMLVCGDDLVV1CESAGTQEDEAS1JIAFTEAMTRYSAPPGDPPKPE 
YDIJELITSCSSNVSVAHDASGKRVYYLTRDPTTPLARAAWETARHTPVNSWL 
GNDMYAPTLWARMIIJdTHFre 
10 RLHGLSAFSLHSYSPGEINRVASCLRKLGVPPLRVWRHRARSVRARLLSQGGR 
AATCGKYU^hTWAVRTKIJ^^ 
RWFMWCLLLLSVGVGIYLLPNR 

SEP. ID. NO. 2: 

15 gccagcccccgattgggggcgacactccaccatagatcactcccctgtgaggaactactgtcttcacgcagaaagcgtcta 
gccatggcgttagtatgagtgtcgtgcagcctccaggaccccccctcccgggagagccatagtggtctgcggaaccggtg 
agtacaccggaattgccaggacgaccgggtcctttcttggatcaacccgctcaatgcctggagatttgggcgtgcccccgcg 
agactgctagccgagtagtgttgggtcgcgaaaggccttgtggtactgcctgatagggtgcttgcgagtgccccgggaggt 
ctcgtagaccgtgcaccatgagcacgaatcctaaacctcaaagaaaaaccaaacgtaacaccaaccgccgcccacagga 

20 cgtcaagttcccgggcggtggtcagatcgtcggtggagtttacctgttgccgcgcaggggccccaggttgggtgtgcgcgc 
gactaggaagacttccgagcggtcgcaacctcgtggaaggcgacaacctatccccaaggctcgccagcccgagggtagg 
gcctgggctcagcccgggtacccctggcccctctatggcaatgagggcttggggtgggcaggatggctcctgtcaccccgt 
ggctctcggcctagttggggccccacggacccccggcgtaggtcgcgcaatttgggtaaggtcatcgataccctcacgtgc 
ggcttcgccgatctcatggggtacattccgctcgtcggcgcccccctagggggcgctgccagggccctggcgcatggcgt 

25 ccgggttctggaggacggcgtgaactatgcaacagggaatctgcccggttgctccttttctatcttccttttggctttgctgtcct 
gtttgaccatcccagcttccgcttatgaagtgcgcaacgtatccggagtgtaccatgtcacgaacgactgctccaacgcaag 
cattgtgtatgaggcagcggacatgatcatgcatacccccgggtgcgtgccctgcgttcgggagaacaactcctcccgctgc 
tgggtagcgctcactcccacgctcgcggccaggaacgctagcgtccccactacgacgatacgacgccatgtcgatttgctc 
gttggggcggctgctctctgctccgctatgtacgtgggagatctctgcggatctgltttcctcgtcgcccagctgttcaccttctc 

30 gcctcgccggcacgagacagtacaggactgcaattgctcaatatatcccggccacgtgacaggtcaccgtatggcttggga 
tatgatgatgaactggtcacctacagcagccctagtggtatcgcagttactccggatcccacaagctgtcgtggatatggtgg 
cgggggcccattggggagtcctagcgggccttgcctactattccatggtggggaactgggctaaggttctgattgtgatgcta 
ctctitgccggcgttgacgggggaacctatgtgacaggggggacgatggccaaaaacaccctcgggattacgtccctctttt 
cacccgggtcatcccagaaaatccagcttgtaaacaccaacggcagctggcacatcaacaggactgccctgaactgcaat 

35 gactccctcaacactgggttccttgctgcgctgttctacgtgcacaagttcaactcatctggatgcccagagcgcatggccag 
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ctgcagccccatcgacgcgttcgctcaggggtgggggcccatcacttacaatgagtcacacagctcggaccagaggcctta 
ttgttggcactacgcaccccggccgtgcggtatcgtacccgcggcgcaggtgtgtggtccagtgtactgcttcaccccaagc 
cctgtcgtggtggggacgaccgaccggttcggcgtccctacgtacagttggggggagaatgagacggacgtgctgcttctt 
aacaacacgcggccgccgcaaggcaactggtttggctgtacatggatgaatagcactgggttcaccaagacgtgcggggg 
5 ccccccgtgtaacatcggggggatcggcaataaaaccttgacctgccccacggactgcttccggaagcaccccgaggcca 
cttacaccaagtgtggttcggggccttggttgacacccagatgcttggtccactacccatacaggctttggcactacccctgc 
actgtcaactttaccatcttcaaggttaggatgtacgtggggggagtggagcacaggctcgaagccgcatgcaattggactc 
gaggagagcgttgtaacctggaggacagggacagatcagagcttagcccgctgctgctgtctacaacggagtggcaggta 
ttgccctgttccttcaccaccctaccggctctgtccactggtttgatccatctccatcagaacgtcgtggacgtacaatacctgt 

10 acggtatagggtcggcggttgtctcctttgcaatcaaatgggagtatgtcctgttgctcttccttcttctggcggacgcgcgcgt 
ctgtgcctgcttgtggatgatgctgctgatagctcaagctgaggccgccctagagaacctggtggtcctcaacgcggcatcc 
gtggccggggcgcatggcattctctccttcctcgtgttcttctgtgctgcctggtacatcaagggcaggctggtccctggggc 
ggcatatgccctctacggcgtatggccgctactcctgctcctgctggcgttaccaccacgagcatacgccatggaccggga 
gatggcagcatcgtgcggaggcgcggttttcgtaggtctgatactcttgaccttgtcaccgcactataagctgttcctcgctag 

15 gctcatatggtggttacaatattttatcaccagggccgaggcacacttgcaagtgtggatcccccccctcaacgttcgggggg 
gccgcgatgccgtcatcctcctcacgtgcgcgatccacccagagctaatctttaccatcaccaaaatcttgctcgccatactc 
ggtccactcatggtgctccaggctggtataaccaaagtgccgtacttcgtgcgcgcacacgggctcattcgtgcatgcatgct 
ggtgcggaaggttgctgggggtcattatgtccaaatggctctcatgaagttggccgcactgacaggtacgtacgtttatgacc 
atctcaccccactgcgggactgggcccacgcgggcctacgagaccttgcggtggcagttgagcccgtcgtcttctctgatat 

20 ggagaccaaggttatcacctggggggcagacaccgcggcgtgtggggacatcatcttgggcctgcccgtctccgcccgca 
gggggagggagatacatctgggaccggcagacagccttgaagggcaggggtggcgactcctcgcgcctattacggccta 
ctcccaacagacgcgaggcctacttggctgcatcatcactagcctcacaggccgggacaggaaccaggtcgagggggag 
gtccaagtggtctccaccgcaacacaatctttcctggcgacctgcgtcaatggcgtgtgttggactgtctatcatggtgccgg 
ctcaaagacccttgccggcccaaagggcccaatcacccaaatgtacaccaatgtggaccaggacctcgtcggctggcaag 

25 cgccccccggggcgcgttccttgacaccatgcacctgcggcagctcggacctttacttggtcacgaggcatgccgatgtcat 
tccggtgcgccggcggggcgacagcagggggagcctactctcccccaggcccgtctcctacttgaagggctcttcgggc 
ggtccactgctctgcccctcggggcacgctgtgggcatctttcgggctgccgtgtgcacccgaggggttgcgaaggcggtg 
gactttgtacccgtcgagtctatggaaaccactatgcggtccccggtcttcacggacaactcgtcccctccggccgtaccgc 
agacattccaggtggcccatctacacgcccctactggiagcggcaagagcactaaggtgccggctgcgtatgcagcccaa 

30 gggtataaggtgcttgtcctgaacccgtccgtcgccgccaccctaggtttcggggcgtatatgtctaaggcacatggtatcga 
ccctaacatcagaaccggggtaaggaccatcaccacgggtgcccccatcacgtactccacctatggcaagtttcttgccgac 
ggtgg tt g ctct ggggg c g cctat g acatcataatat gtg a tgagtgccactcaactgactcgaccactatcctgggcatcgg 
cacagtcctggaccaagcggagacggctggagcgcgactcgtcgtgctcgccaccgctacgcctccgggatcggtcacc 
gtgccacatccaaacatcgaggaggtggctctgtccagcactggagaaatccccttttatggcaaagccatccccatcgaga 

35 ccatcaagggggggaggcacctcattttctgccattccaagaagaaatgtgatgagctcgccgcgaagctglccggcctcg 
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gactcaatgctgtagcatattaccggggccttgatgtatccgtcataccaactagcggagacgtcattgtcgtagcaacggac 
gctctaatgacgggctttaccggcgatttcgactcagtgatcgactgcaatacatgtgtcacccagacagtcgacttcagcct 
ggacccgaccttcaccattgagacgacgaccgtgccacaagacgcggtgtcacgctcgcagcggcgaggcaggactggt 
aggggcaggatgggcatttacaggtttgtgactccaggagaacggccctcgggcatgttcgattcctcggttctgtgcgagt 
5 gctatgacgcgggctgtgcttggtacgagctcacgcccgccgagacctcagttaggttgcgggcttacctaaacacaccag 
ggttgcccgtctgccaggaccatctggagttctgggagagcgtctttacaggcctcacccacatagacgcccatttcttgtcc 
cagactaagcaggcaggagacaacttcccctacctggtagcataccaggctacggtgtgcgccagggctcaggctccacc 
tccatcgtgggaccaaatgtggaagtgtctcatacggctaaagcctacgctgcacgggccaacgcccctgctgtataggctg 
ggagccgttcaaaacgaggttactaccacacaccccataaccaaatacatcatggcatgcatgtcggctgacctggaggtc 

10 gtcacgagcacctgggtgctggtaggcggagtcctagcagctctggccgcgtattgcctgacaacaggcagcgtggtcatt 
gtgggcaggatcatcttgtccggaaagccggccatcattcccgacagggaagtcctttaccgggagttcgatgagatggaa 
gagtgcgcctcacacctcccttacatcgaacagggaatgcagctcgccgaacaattcaaacagaaggcaatcgggttgctg 
caaacagccaccaagcaagcggaggctgctgctcccgtggtggaatccaagtggcggaccctcgaagccttctgggcga 
agcatatgtggaatttcatcagcgggatacaatatttagcaggcttgtccactctgcctggcaaccccgcgatagcatcactga 

15 tggcattcacagcctctatcaccagcccgctcaccacccaacataccctcctgtttaacatcctggggggatgggtggccgc 
ccaacttgctcctcccagcgctgcttctgctttcgtaggcgccggcatcgctggagcggctgttggcagcataggccttggg 
aaggtgcttgtggatattttggcaggttatggagcaggggtggcaggcgcgctcgtggcctttaaggtcatgagcggcgag 
atgccctccaccgaggacctggttaacctactccctgctatcctctcccctggcgccctagtcgtcggggtcgtgtgcgcagc 
gatactgcgtcggcacgtgggcccaggggagggggctgtgcagtggatgaaccggctgatagcgttcgcttcgcggggta 

20 accacgtctcccccacgcactatgtgcctgagagcgacgctgcagcacgtgtcactcagatcctctctagtcttaccatcact 
cagctgctgaagaggcttcaccagtggatcaacgaggactgctccacgccatgctccggctcgtggctaagagatgtttgg 
gattggatatgcacggtgttgactgatttcaagacctggctccagtccaagctcctgccgcgattgccgggagtccccttcttc 
tcatgtcaacgtgggtacaagggagtctggcggggcgacggcatcatgcaaaccacctgcccatgtggagcacagatcac 
cggacatgtgaaaaacggttccatgaggatcgtggggcctaggacctgtagtaacacgtggcatggaacattccccattaac 

25 gcgtacaccacgggcccctgcacgccctccccggcgccaaattattctagggcgctgtggcgggtggctgctgaggagta 
cgtggaggttacgcgggtgggggatttccactacgtgacgggcatgaccactgacaacgtaaagtgcccgtgtcaggttcc 
ggcccccgaattcttcacagaagtggatggggtgcggttgcacaggtacgctccagcgtgcaaacccctcctacgggagg 
aggtcacattcctggtcgggctcaatcaatacctggttgggtcacagctcccatgcgagcccgaaccggacgtagcagtgct 
cacttccatgctcaccgacccctcccacattacggcggagacggctaagcgtaggctggccaggggatctcccccctcctt 

30 ggccagctcatcagctagccagctgtctgcgccttccttgaaggcaacatgcactacccgtcatgactccccggacgctgac 
ctcatcgaggccaacctcctgtggcggcaggagatgggcgggaacatcacccgcgtggagtcagaaaataaggtagtaat 
tttggactctttcgagccgctccaagcggaggaggatgagagggaagtatccgttccggcggagatcctgcggaggtcca 
ggaaattccctcgagcgatgcccatatgggcacgcccggattacaaccctccactgttagagtcctggaaggacccggact 
acgtccctccagtggtacacgggtgtccattgccgcctgccaaggcccctccgataccacctccacggaggaagaggacg 

35 gttgtcctgtcagaatctaccgtgtcttctgcctiggcggagctcgccacaaagaccttcggcagctccgaatcgtcggccgt 
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cgacagcggcacggcaacggcctctcctgaccagccctccgacgacggcgacgcgggatccgacgttgagtcgtactcc 
tccatgcccccccttgagggggagccgggggatcccgatctcagcgacgggtcttggtctaccgtaagcgaggaggctag 
tgaggacgtcgtctgctgctcgatgtcctacacatggacaggcgccctgatcacgccatgcgctgcggaggaaaccaagct 
gcccatcaatgcactgagcaactctttgctccgtcaccacaacttggtctatgctacaacatctcgcagcgcaagcctgcggc 
5 agaagaaggtcacctttgacagactgcaggtcctggacgaccactaccgggacgtgctcaaggagatgaaggcgaaggc 
gtccacagttaaggctaaacttctatccgtggaggaagcctgtaagctgacgcccccacattcggccagatctaaatttggct 
at ggggcaaaggacgtccggaacctatccagcaaggccgttaaccacatccgctccgtgtggaaggacttgctggaagac 
actgagacaccaattgacaccaccatcatggcaaaaaatgaggttttctgcgtccaaccagagaaggggggccgcaagcc 
agctcgccttatcgtattcccagatttgggggttcgtgtgtgcgagaaaatggccctttacgatgtggtctccaccctccctcag 

10 gccgtgatgggctcttcatacggattccaatactctcctggacagcgggtcgagttcctggtgaatgcctggaaagcgaaga 
aatgccctatgggcttcgcatatgacacccgctgttttgactcaacggtcactgagaatgacatccgtgttgaggagtcaatct 
accaatgttgtgacttggcccccgaagccagacaggccataaggtcgctcacagagcggctttacatcgggggccccctga 
ctaattctaaagggcagaactgcggctatcgccggtgccgcgcgagcggtgtactgacgaccagctgcggtaataccctca 
catgttacttgaaggccgctgcggcctgtcgagctgcgaagctccaggactgcacgatgctcgtatgcggagacgaccttgt 

15 cgttatctgtgaaagcgcggggacccaagaggacgaggcgagcctacgggccttcacggaggctatgactagatactctg 
ccccccctggggacccgcccaaaccagaatacgacttggagttgataacatcatgctcctccaatgtgtcagtcgcgcacg 
atgcatctggcaaaagggtgtactatctcacccgtgaccccaccaccccccttgcgcgggctgcgtgggagacagctagac 
acactccagtcaattcctggctaggcaacatcatcatgtatgcgcccaccttgtgggcaaggatgatcctgatgactcatttctt 
ctccatccttctagctcaggaacaacttgaaaaagccctagattgtcagatctacggggcctgttactccattgagccacttga 

20 cctacctcagatcattcaacgactccatggccttagcgcattttcactccatagttactctccaggtgagatcaatagggtggct 
tcatgcctcaggaaacttggggtaccgcccttgcgagtctggagacatcgggccagaagtgtccgcgctaggctactgtcc 
ca ggggggg a ggg ct g ccactt g t gg caa g tacct c t tcaactgggcagtaaggaccaagctcaaactcactccaatcccg 
gctgcgtcccagttggatttatccagctggttcgttgctggttacagcgggggagacatatatcacagcctgtctcgtgcccga 
ccccgctggttcatgtggtgcctactcctactttctgtaggggtaggcatctatctactccccaaccgatgaacggggagctaa 

25 acactccaggccaataggccatcctgtttttttccctttttttttttcttttttttttttttttttttttttttttttttttctcctttttttttcctcttttt 
ttccttttctttcctttggtggctccatcttagccctagtcacggctagctgtgaaaggtccgtgagccgcttgactgcagagagt 
gctgatactggcctctctgcagatcaagt 

Other embodiments are within the following claims. While several 
30 embodiments have been shown and described, various modifications may be made 
without departing from the spirit and scope of the present invention. 
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WHAT IS CLAIMED IS: 

1. A nucleic acid molecule comprising a region selected from the 
group consisting of: 

5 a) an altered HCV NS3 encoding region coding for one or more 

NS3 mutations, wherein at least one of said NS3 mutations, identified by reference to 
the amino acid sequence numbering of SEQ. ID. NO. 1, is selected from the group 
consisting of: 

amino acid 1095 being Ala, 
10 amino acid 1202 being Gly, and 
amino acid 1347 being Thr; 

b) an altered HCV NS5A encoding region coding for one or more 
NS5A mutations, wherein at least one of said NS5A mutations, identified by reference 
to the amino acid sequence numbering of SEQ. ID. NO. 1, is selected from the group 

15 consisting of: 

amino acid 2041 being Thr, 

a Lys insertion between residue 2039 and 2040. 

amino acid 2173 being Phe, 

amino acid 2197 being Phe, 
20 amino acid 2198 being Ser, 

amino acid 2199 being Thr, and 

amino acid 2204 being Arg; and 

c) an altered encephalomyocarditis virus (EMCV) internal 
ribosome entry site (IRES) region containing one or more EMCV IRES mutations, 

25 wherein at least one of said EMCV IRES mutations, identified by reference to the 
nucleotide number of SEQ. ID. NO. 3, is an insertion at nucleotide 1736 of adenine. 

2. The nucleic acid molecule of claim 1, wherein said nucleic acid 
molecule comprises said NS5A encoding region. 



30 



3. The nucleic acid molecule of claim 2, wherein at least two of 
said NS5A adaptive mutations are present. 
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4. The nucleic acid molecule of claim 2, further comprising a 
region encoding for a HCV NS3 region, wherein said NS3 region may be the same or 
different than said altered NS3 region. 

5 5. The nucleic acid molecule of claim 4, wherein said nucleic acid 

molecule is an HCV replicon comprising a HCV 5' UTR-PC region, said NS3 
encoding region, an HCV NS4A encoding region, an HCV NS4B encoding region, 
said NS5A encoding region, an HCV NS5B encoding region, and a HCV 3' UTR. 

10 6. The nucleic acid molecule of claim 5, wherein said HCV 

replicon further comprises a sequence encoding for a reporter protein. 

7. The nucleic acid molecule of claim 5, wherein said HCV 
replicon further comprises a sequence encoding for a selection protein. 

15 

8. The nucleic acid molecule of claim 5, wherein said HCV 
replicon further comprises a HCV core encoding region, a HCV El encoding region, a 
HCV E2 encoding region, a HCV p7 encoding region, and a HCV NS2 encoding 
region. 

20 

9. A nucleic acid molecule comprising a region selected from the 
group consisting of: 

a) an altered HCV NS3 encoding region containing one or more 
NS3 mutations, wherein at least one of said NS3 mutations, identified by reference to 

25 the nucleotide numbering of SEQ. ID. NO. 2, is selected from the group consisting of: 
nucleotide 3625 being cytosine, 
nucleotide 3946 being guanine, 
nucleotide 4380 being adenine, 

b) an altered HCV NS5A encoding region containing one or more 
30 NS5A mutations, wherein at least one of said NS5A mutations, identified by reference 

to the nucleotide numbering of SEQ. ID. NO. 2, is selected from the group consisting 
of: 

an insertion of 3 adenine residues between nucleotide 6458 and 6459, 
nucleotide 6463 being cytosine, 
35 nucleotide 6859 being thymine or uracil, 
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nucleotide 6931 being thymine or uracil, 
nucleotide 6934 being cytosine, 
nucleotide 6936 being adenine, and 
nucleotide 6953 being adenine or guanine; and 
5 c) an altered encephalomyocarditis virus (EMCV) internal 

ribosome entry site (IRES) region containing one or more EMCV ERES mutations, 
wherein at least one of said EMCV IRES mutations, identified by reference to the 
nucleotide number of SEQ. ID. NO. 3, is an insertion at nucleotide 1736 of adenine. 

10. The nucleic acid molecule of claim 9, wherein said molecule 
comprises said altered NS5A encoding region, and the nucleotide sequence of said 
altered NS5A region is provided for by bases 6258-7598 of SEQ. ID. NO. 2, or the 
RNA version thereof, modified with one or more of said NS5A modifications selected 
from the group consisting of: 

an insertion of 3 adenine residues between nucleotide 6458 and 6459, 
nucleotide 6463 being cytosine, 
nucleotide 6859 being thymine or uracil, 
nucleotide 6931 being thymine or uracil, 
nucleotide 6934 being cytosine, 
nucleotide 6936 being adenine, and 
nucleotide 6953 being adenine or guanine. 

11. The nucleic acid molecule of claim 10, wherein said molecule 
is an HCV replicon comprising a HCV 5' UTR-PC region, a modified HCV NS3- 

25 NS5B region, and a HCV 3' UTR, wherein said modified NS3-NS5B region 
comprises said altered NS5A region. 

12. The nucleic acid molecule of claim 11, wherein said 5' UTR- 
PC region is the RNA version of bases 1-377 of SEQ. ID. NO. 2 and said 3' UTR is 

30 the RNA version of bases 9374-9605 of SEQ. ID. NO. 2. 

13. The nucleic acid molecule of claim 10, wherein said molecule 
is an HCV replicon comprising a HCV 5' UTR-PC region, a modified HCV NS3- 
NS5B region, and a HCV 3' UTR, wherein 

35 said 5* UTR-PC region is the RNA version of bases 1-377 of SEQ. ID. NO. 2; 

42 
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said 3' UTR is the RNA version of bases 9374-9605 of SEQ. ED. NO. 2; and 
said modified NS3-NS5B region consists of the RNA version of bases 3420-9371 of 
SEQ. ED. NO. 2 modified with one or more modifications selected from the group 
consisting of: 
5 nucleotide 4380 being adenine, 
nucleotide 3625 being cytosine, 
nucleotide 3946 being guanine, 

an insertion of 3 adenine residues between nucleotide 6458 and nucleotide 6459, 
nucleotide 6463 being cytosine, 
10 nucleotide 6859 being uracil, 
nucleotide 6931 being uracil, 
nucleotide 6934 being cytosine, 
nucleotide 6936 being adenine, and 
nucleotide 6953 being adenine or guanine. 

15 

14. The nucleic acid molecule of claim 13, wherein said replicon is 
a genomic replicon that further comprises the RNA version of nucleotides 378-3419 
of SEQ. ID. NO. 2. 

20 15. A nucleic acid molecule comprising the nucleic acid base 

sequence of bases 1-7989 of SEQ. ED. NO. 3, or the RNA version thereof, consisting 
of one or more different modifications selected from the group consisting of: 

a) nucleotides 5335-5337 modified to code for arginine; 

b) nucleotides 5242-5244 modified to code for phenylalanine; 
25 c) nucleotides 5314-5316 modified to code for phenylalanine; 

d) nucleotides 5317-5319 modified to code for serine; 

e) nucleotides coding for lysine inserted after nucleotide 4843; 

0 nucleotides 2329-2331 modified to code for glycine, nucleotides 2764-2766 
modified to code for threonine, nucleotides 5242-5244 modified to code for 
30 phenylalanine, and an extra adenosine inserted after nucleotide 1736; 

g) nucleotides 4846-4848 modified to code for threonine, and nucleotides 5242-5244 
modified to modified to code for phenylalanine; 

h) nucleotides 4846-4848 modified to code for threonine, and nucleotides 5314-5316 
modified to code for phenylalanine; 
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i) nucleotides 4846-4848 modified to code for threonine, and nucleotides 5317-5319 
modified to code for serine; 

j) nucleotides 2329-2331 modified to code for glycine, and nucleotides coding for 
lysine inserted after nucleotides 4843; 
5 k) nucleotides 5314-5316 modified to code for phenylalanine and nucleotides 5320- 
5322 modified to code for threonine; 

1) nucleotides 4846-4848 modified to code for threonine, nucleotides 5314-5316 
modified to code for phenylalanine, and nucleotides 5320-5322 modified to code for 
threonine; 

10 m) nucleotides 4846-4848 modified to code for threonine, nucleotides 5314-53 16 
modified to code for phenylalanine, and an extra adenosine inserted after nucleotide 
1736; and 

n) nucleotides 53 14-5316 modified to code for phenylalanine, nucleotides 5320-5322 
modified to code for threonine, and an extra adenosine inserted after nucleotide 1736; 
15 and 

0) nucleotides 5320-5322 modified to code for threonine. 

16. The nucleic acid of claim 15, wherein said one or more 
different modifications is selected from the group consisting of: 
20 a) C5337A; 

b) C5243TorU; 

c) C5315TorU; 

d) TorU5318C; 

e) AAA inserted after 4843; 

25 f) A2330G, G2764A, C5243T or U, and adenosine inserted 1736; 

g) A4847C and C5243T or U; 

h) A4847C and C53 1 5T or U; 

1) A4847C and T or U53 1 8C; 

j) A2330G and AAA inserted after 4843; 
30 k) C5315TorUandG5320A; 

1) A4847C,C5315TorU, and G5320A; 
m) A4847C, C5315T or U, and adenosine inserted 1736; 
n) C5315T or U, G5320A and adenosine inserted 1736; and 
o) G5320A. 
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17. The nucleic acid of claim 16, wherein said nucleic acid is RNA 
and comprises said nucleic acid base sequence. 

5 18. The nucleic acid of claim 17, wherein said nucleic acid is RNA 

and consists of said nucleic acid base sequence. 

19. An expression vector comprising a nucleotide sequence coding 
for the nucleic acid molecule of any one of claims 1-18, wherein said nucleotide 

10 sequence is transcriptionally coupled to an exogenous promoter. 

20. A recombinant cell human hepatoma cell, wherein said cell 
comprises the nucleic acid of any one of claims 5-8 and 1 1-18. 

15 21. The recombinant cell of claim 20, wherein said hepatoma cell 

is an Huh-7 cell. 

22. The recombinant cell of claim 20, wherein said cell is derived 
from a Huh-7 cell. 

20 

23. A recombinant cell made by a process comprising the step of 
introducing into a human hepatoma cell the nucleic acid of any one of claims 5-8 and 
11-18. 

25 24. A method of making an HCV replicon enhanced cell 

comprising the steps of: 

a) introducing and maintaining a HCV replicon in a cell; and 

b) curing said cell of said HCV replicon to produce said replicon 

enhanced cell. 

30 

25. The method of claim 24, wherein said cell is a human 

hepatoma cell. 

26. The method of claim 24, wherein said ceil is a Huh-7 cell or is 
35 derived from a Huh-7 cell. 
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27. The method of claim 26, further comprising the step of 
confirming the ability of said replicon enhanced cell to maintain an HCV replicon. 

5 28 A method of making an HCV replicon enhanced cell containing 

a functional HCV replicon comprising the steps of: 

a) introducing and maintaining a first HCV replicon in a cell; 

b) curing said cell of said first replicon to produce a cured cell; 

and 

10 c) introducing and maintaining a second HCV replicon into said 

cured cell, wherein said second HCV replicon may be the same or different than said 
first HCV replicon. 

29 The method of claim 28, wherein said cell is a human 

15 hepatoma cell. 

30. The method of claim 29, wherein said human hepatoma cell is 

a Huh-7 cell. 

20 31. The method of claim 30, wherein said human hepatoma cell is 

derived from a Huh-7 cell. 

32. An HCV replicon enhanced cell made by the method of any 
one of claims 24-27. 

25 

33. An HCV replicon enhanced cell containing a HCV replicon 
made by the method of any one of claims 28-3 1 . 

34. A method of measuring the ability of a compound to affect 
30 HCV activity comprising the steps of: 

a) providing said compound to the HCV replicon enhanced cell of 

claim 33; and 

b) measuring the ability of said compound to effect one or more 
replicon activities as a measure of the effect on HCV activity. 

35 
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35. The method of claim 34, wherein said compound is a ribozyme. 

36. The method of claim 34, wherein said compound in an 
antisense nucleic acid. 

5 

37. The method of claim 34, wherein compound is an organic 

compound. 

38. The method of claim 34, wherein said step (b) measures HCV 
10 protein production. 

39. The method of claim 33, wherein said step (b) measures 
production of RNA transcripts. 
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1 


GCCAGCCCCC 


GATTGGGGGC 


GACACTCCAC 


CATAGATCAC 


TCCCCTGTGA 


51 


GGAACTACTG 


TCTTCACGCA 


GAAAGCGTCT 


AGCCATGGCG 


TTAGTATGAG 


101 


TGTCGTGCAG 


CCTCCAGGAC 


CCCCCCTCCC 


GGGAGAGCCA 


TAGTGGTCTG 


151 


CGGAACCGGT 


GAGTACACCG 


GAATTGCCAG 


GACGACCGGG 


TCCTTTCTTG 


201 


GATCAACCCG 


CTCAATGCCT 


GGAGATTTGG 


GCGTGCCCCC 


GCGAGACTGC 


251 


TAGCCGAGTA 


GTGTTGGGTC 


GCGAAAGGCC 


TTGTGGTACT 


GCCTGATAGG 


301 


GTGCTTGCGA 


GTGCCCCGGG 


AGGTCTCGTA 


GACCGTGCAC 


CATGAGCACG 


351 


AATCCTAAAC 


CTCAAAGAAA AACCAAAGGG 


CGCGCCATGA TTGAACAAGA 


401 


TGGATTGCAC 


GCAGGTTCTC 


CGGCCGCTTG 


GGTGGAGAGG 


CTATTCGGCT 


451 


ATGACTGGGC 


ACAACAGACA ATCGGCTGCT 


CTGATGCCGC 


CGTGTTCCGG 


501 


CTGTCAGCGC 


AGGGGCGCCC 


GGTTCTTTTT 


GTCAAGACCG 


ACCTGTCCGG 


551 


TGCCCTGAAT 


GAACTGCAGG 


ACGAGGCAGC 


GCGGCTATCG 


TGGCTGGCCA 


601 


CGACGGGCGT 


TCCTTGCGCA 


GCTGTGCTCG 


ACGTTGTCAC 


TGAAGCGGGA 


651 


AGGGACTGGC 


TGCTATTGGG 


CGAAGTGCCG 


GGGCAGGATC 


TCCTGTCATC 


701 


TCACCTTGCT 


CCTGCCGAGA 


AAGTATCCAT 


CATGGCTGAT 


GCAATGCGGC 


751 


GGCTGCATAC 


GCTTGATCCG 


GCTACCTGCC 


CATTCGACCA 


CCAAGCGAAA 


801 


CATCGCATCG 


AGCGAGCACG 


TACTCGGATG 


GAAGCCGGTC 


TTGTCGATCA 


851 


GGATGATCTG 


GACGAAGAGC 


ATCAGGGGCT 


CGCGCCAGCC 


GAACTGTTCG 


901 


CCAGGCTCAA 


GGCGCGCATG 


CCCGACGGCG 


AGGATCTCGT 


CGTGACCCAT 


951 


GGCGATGCCT 


GCTTGCCGAA 


TATCATGGTG 


GAAAATGGCC 


GCTTTTCTGG 


1001 


ATTCATCGAC 


TGTGGCCGGC 


TGGGTGTGGC 


GGACCGCTAT 


CAGGACATAG 


1051 


CGTTGGCTAC 


CCGTGATATT 


GCTGAAGAGC 


TTGGCGGCGA 


ATGGGCTGAC 


1101 


CGCTTCCTCG 


TGCTTTACGG 


TATCGCCGCT 


CCCGATTCGC 


AGCGCATCGC 


1151 


CTTCTATCGC 


CTTCTTGACG 


AGTTCTTCTG 


AGTTTAAACA 


GACCACAACG 


1201 


GTTTCCCTCT 


AGCGGGATCA 


ATTCCGCCCC 


TCTCCCTCCC 


CCCCCCCTAA 


1251 


CGTTACTGGC 


CGAAGCCGCT 


TGGAATAAGG 


CCGGTGTGCG 


TTTGTCTATA ■ 


1301 


tc tt a r r r v r T"vr' 

lul Xnl i. 1 1L 


CACCATATTG 


CCGTCTTTTG 


GCAATGTGAG 


GGCCCGGAAA 


1351 


CCTGGCCCTG 


TCTTCTTGAC 


GAGCATTCCT 


AGGGGTCTTT 


CCCCTCTCGC 


1401 


CAAAGGAATG 


CAAGGTCTGT 


TGAATGTCGT 


GAAGGAAGCA 


GTTCCTCTGG 


1451 


AAGCTTCTTG 


AAGACAAACA 


ACGTCTGTAG 


CGACCCTTTG 


CAGGCAGCGG 


1501 


AACCCCCCAC 


CTGGCGACAG 


GTGCCTCTGC 


GGCCAAAAGC 


CACGTGTATA 
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1551 


AGATACACCT 


GCAAAGGCGG 


CACAACCCCA 


GTGCCACGTT 


GTGAGTTGGA 


1601 


TAGTTGTGGA 


AAGAGTCAAA 


TGGCTCTCCT 


CAAGCGTATT 


CAACAAGGGG 


1651 


CTGAAGGATG 


CCCAGAAGGT 


ACCCCATTGT 


ATGGGATCTG 


ATCTGGGGCC 


1701 


TCGGTGCACA 


TGCTTTACAT 


GTGTTTAGTC 


GAGGTTAAAA 


AACGTCTAGG 


1751 


CCCCCCGAAC 


CACGGGGACG 


TGGTTTTCCT 


TTGAAAAACA 


CGATAATACC 


1801 


ATGGCGCCTA 


TTACGGCCTA 


CTCCCAACAG 


ACGCGAGGCC 


TACTTGGCTG 


1851 


CATCATCACT 


AGCCTCACAG 


GCCGGGACAG 


GAACCAGGTC 


GAGGGGGAGG 


1901 


TCCAAGTGGT 


CTCCACCGCA 


ACACAATCTT 


TCCTGGCGAC 


CTGCGTCAAT 


1951 


GGCGTGTGTT 


GGACTGTCTA 


TCATGGTGCC 


GGCTCAAAGA 


CCCTTGCCGG 


2001 


CCCAAAGGGC 


CCAATCACCC 


AAATGTACAC 


CAATGTGGAC 


CAGGACCTCG 


2051 


TCGGCTGGCA AGCGCCCCCC 


GGGGCGCGTT 


CCTTGACACC 


ATGCACCTGC 


2101 


GGCAGCTCGG 


ACCTTTACTT 


GGTCACGAGG 


CATGCCGATG 


TCATTCCGGT 


2151 


GCGCCGGCGG 


GGCGACAGCA 


GGGGGAGCCT 


ACTCTCCCCC 


AGGCCCGTCT 


2201 


CCTACTTGAA 


GGGCTCTTCG 


GGCGGTCCAC 


TGCTCTGCCC 


CTCGGGGCAC 


2251 


GCTGTGGGCA 


TCTTTCGGGC 


TGCCGTGTGC 


ACCCGAGGGG 


TTGCGAAGGC 


2301 


GGTGGACTTT 


GTACCCGTCG 


AGTCTATGGA 


AACCACTATG 


CGGTCCCCGG 


2351 


TCTTCACGGA 


CAACTCGTCC 


CCTCCGGCCG 


TACCGCAGAC 


ATTCCAGGTG 


2401 


GCCCATCTAC 


ACGCCCCTAC 


TGGTAGCGGC 


AAGAGCACTA 


AGGTGCCGGC 


2451 


TGCGTATGCA 


GCCCAAGGGT 


ATAAGGTGCT 


TGTCCTGAAC 


CCGTCCGTCG 


2501 


CCGCCACCCT 


AGGTTTCGGG 


GCGTATATGT 


C TAAGGC AC A 


TGGTATCGAC 


2551 


CCTAACATCA 


GAACCGGGGT 


AAGGACCATC 


ACCACGGGTG 


CCCCCATCAC 


2601 


GTACTCCACC 


TATGGCAAGT 


TTCTTGCCGA 


CGGTGGTTGC 


TCTGGGGGCG 


2651 


CCTATGACAT 


CATAATATGT 


GATGAGTGCC 


ACTCAACTGA 


CTCGACCACT 


2701 


ATCCTGGGCA 


TCGGCACAGT 


CCTGGACCAA 


GCGGAGACGG 


CTGGAGCGCG 


2751 


ACTCGTCGTG 


CTCGCCACCG 


CTACGCCTCC 


GGGATCGGTC 


ACCGTGCCAC 


2801 


ATCCAAACAT 


CGAGGAGGTG 


GCTCTGTCCA 


GCACTGGAGA 


AATCCCCTTT 


2851 


TATGGCAAAG 


CCATCCCCAT 


CGAGACCATC 


a Annnnnnn a 


P.P.P APPTP AT 


2901 


TTTCTGCCAT 


TCCAAGAAGA 


AATGTGATGA 


GCTCGCCGCG 


AAGCTGTCCG 


2951 


GCCTCGGACT 


CAATGCTGTA 


GCATATTACC 


GGGGCCTTGA 


TGTATCCGTC 


3001 


ATACCAACTA 


GCGGAGACGT 


CATTGTCGTA 


GCAACGGACG 


CTCTAATGAC 


3051 


GGGCTTTACC 


GGCGATTTCG 


ACTCAGTGAT 


CGACTGCAAT 


ACATGTGTCA 
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3101 


CCCAGACAGT 


CGACTTCAGC 


CTGGACCCGA 


CCTTCACCAT 


TGAGACGACG 


3151 


ACCGTGCCAC 


AAGACGCGGT 


GTCACGCTCG 


CAGCGGCGAG 


GCAGGACTGG 


3201 


TAGGGGCAGG 


ATGGGCATTT 


ACAGGTTTGT 


GACTCCAGGA 


GAACGGCCCT 


3251 


CGGGCATGTT 


CGATTCCTCG 


GTTCTGTGCG 


AGTGCTATGA 


CGCGGGCTGT 


3301 


GCTTGGTACG 


AGCTCACGCC 


CGCCGAGACC 


TCAGTTAGGT 


TGCGGGCTTA 


3351 


CCTAAACACA 


CCAGGGTTGC 


CCGTCTGCCA GGACCATCTG 


GAGTTCTGGG 


3401 


AGAGCGTCTT 


TACAGGCCTC 


ACCCACATAG 


ACGCCCATTT 


CTTGTCCCAG 


3451 


ACTAAGCAGG 


CAGGAGACAA 


CTTCCCCTAC 


CTGGTAGCAT 


ACCAGGCTAC 


3501 


GGTGTGCGCC 


AGGGCTCAGG 


CTCCACCTCC 


ATCGTGGGAC 


CAAATGTGGA 


3551 


AGTGTCTCAT 


ACGGCTAAAG 


CCTACGCTGC 


ACGGGCCAAC 


GCCCCTGCTG 


3601 


TATAGGCTGG 


GAGCCGTTCA 


AAACGAGGTT 


ACTACCACAC 


ACCCCATAAC 


3651 


CAAATACATC 


ATGGCATGCA 


TGTCGGCTGA 


CCTGGAGGTC 


GTCACGAGCA 


3701 


CCTGGGTGCT 


GGTAGGCGGA 


GTCCTAGCAG 


CTCTGGCCGC 


GTATTGCCTG 


3751 


ACAACAGGCA 


GCGTGGTCAT 


TGTGGGCAGG 


ATCATCTTGT 


CCGGAAAGCC 


3801 


GGCCATCATT 


CCCGACAGGG 


AAGTCCTTTA 


CCGGGAGTTC 


GATGAGATGG 


3851 


AAGAGTGCGC 


CTCACACCTC 


CCTTACATCG 


AACAGGGAAT 


GCAGCTCGCC 


3901 


GAACAATTCA 


AACAGAAGGC 


AATCGGGTTG 


CTGCAAACAG 


CCACCAAGCA 


3951 


AGCGGAGGCT 


GCTGCTCCCG 


TGGTGGAATC 


CAAGTGGCGG 


ACCCTCGAAG 


4001 


CCTTCTGGGC 


GAAGCATATG 


TGGAATTTCA 


TCAGCGGGAT 


ACAATATTTA 


4051 


GCAGGCTTGT 


CCACTCTGCC 


TGGCAACCCC 


GCGATAGCAT 


CACTGATGGC 


4101 


ATTCACAGCC 


TCTATCACCA 


GCCCGCTCAC 


CACCCAACAT 


ACCCTCCTGT 


4151 


TTAACATCCT 


GGGGGGATGG 


GTGGCCGCCC 


AACTTGCTCC 


TCCCAGCGCT 


4201 


GCTTCTGCTT 


TCGTAGGCGC 


CGGCATCGCT 


GGAGCGGCTG 


TTGGCAGCAT 


4251 


AGGCCTTGGG 


AAGGTGCTTG 


TGGATATTTT 


GGCAGGTTAT 


GGAGC AGGGG 


4301 


TGGCAGGCGC 


GCTCGTGGCC 


TTTAAGGTCA 


TGAGCGGCGA 


GATGCCCTCC 


4351 


ACCGAGGACC 


TGGTTAACCT 


ACTCCCTGCT 


ATCCTCTCCC 


CTGGCGCCCT 


dam 






CAGCGATACT 


GCGTCGGCAC 




4451 


GGGAGGGGGC 


TGTGCAGTGG 


ATGAACCGGC 


TGATAGCGTT 


CGCTTCGCGG 


4501 


GGTAACCACG 


TCTCCCCCAC 


GCACTATGTG 


CCTGAGAGCG 


ACGCTGCAGC 


4551 


ACGTGTCACT 


CAGATCCTCT 


CTAGTCTTAC 


CATCACTCAG 


CTGCTGAAGA 


4601 


GGCTTCACCA 


GTGGATCAAC 


GAGGACTGCT 


CCACGCCATG 


CTCCGGCTCG 
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4651 TGGCTAAGAG ATGTTTGGGA TTGGATATGC ACGGTGTTGA CTGATTTCAA 

4701 GACCTGGCTC CAGTCCAAGC TCCTGCCGCG ATTGCCGGGA GTCCCCTTCT 

4751 TCTCATGTCA ACGTGGGTAC AAGGGAGTCT GGCGGGGCGA CGGCATCATG 

4801 CAAACCACCT GCCCATGTGG AGCACAGATC ACCGGACATG TGAAAAACGG 

4851 TTCCATGAGG ATCGTGGGGC CTAGGACCTG TAGTAACACG TGGCATGGAA 

4901 CATTCCCCAT TAACGCGTAC ACCACGGGCC CCTGCACGCC CTCCCCGGCG 

4951 CCAAATTATT CTAGGGCGCT GTGGCGGGTG GCTGCTGAGG AGTACGTGGA 

5001 GGTTACGCGG GTGGGGGATT TCCACTACGT GACGGGCATG ACCACTGACA 

5051 ACGTAAAGTG CCCGTGTCAG GTTCCGGCCC CCGAATTCTT CACAGAAGTG 

5101 GATGGGGTGC GGTTGCACAG GTACGCTCCA GCGTGCAAAC CCCTCCTACG 

5151 GGAGGAGGTC ACATTCCTGG TCGGGCTCAA TCAATACCTG GTTGGGTCAC 

5201 AGCTCCCATG CGAGCCCGAA CCGGACGTAG CAGTGCTCAC TTCCATGCTC 

5251 ACCGACCCCT CCCACATTAC GGCGGAGACG GCTAAGCGTA. GGCTGGCCAG 

5301 GGGATCTCCC CCCTCCTTGG CCAGCTCATC AGCTAGCCAG CTGTCTGCGC 

5351 CTTCCTTGAA GGCAACATGC ACTACCCGTC ATGACTCCCC GGACGCTGAC 

54 01 CTCATCGAGG CCAACCTCCT GTGGCGGCAG GAGATGGGCG GGAACATCAC 

5451 CCGCGTGGAG TCAGAAAATA AGGTAGTAAT TTTGGACTCT TTCGAGCCGC 

5501 TCCAAGCGGA GGAGGATGAG AGGGAAGTAT CCGTTCCGGC GGAGATCCTG 

5551 CGGAGGTCCA GGAAATTCCC TCGAGCGATG CCCATATGGG CACGCCCGGA 

5601 TTACAACCCT CCACTGTTAG AGTCCTGGAA GGACCCGGAC TACGTCCCTC 

5651 CAGTGGTACA CGGGTGTCCA TTGCCGCCTG CCAAGGCCCC TCCGATACCA 

5701 CCTCCACGGA GGAAGAGGAC GGTTGTCCTG TCAGAATCTA CCGTGTCTTC 

5751 TGCCTTGGCG GAGCTCGCCA C AAAG ACCTT CGGCAGCTCC GAATCGTCGG 

5801 CCGTCGACAG CGGCACGGCA ACGGCCTCTC CTGACCAGCC CTCCGACGAC 

5851 GGCGACGCGG GATCCGACGT TGAGTCGTAC TCCTCCATGC CCCCCCTTGA 

5901 GGGGGAGCCG GGGGATCCCG ATCTCAGCGA CGGGTCTTGG TCTACCGTAA 

5951 GCGAGGAGGC TAGTGAGGAC GTCGTCTGCT GCTCGATGTC CTACACATGG 

6001 ACAGGCGCCC TGATCACGCC ATGCGCTGCG GAGGAAACCA AGCTGCCCAT 

6051 CAATGCACTG AGCAACTCTT TGCTCCGTCA CCACAACTTG GTCTATGCTA 

6101 CAACATCTCG CAGCGCAAGC CTGCGGCAGA AGAAGGTCAC CTTTGACAGA 

6151 CTGCAGGTCC TGGACGACCA CTACCGGGAC GTGCTCAAGG AGATGAAGGC 
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6201 


GAAGGCGTCC 


ACAGTTAAGG 


CTAAACTTCT 


ATCCGTGGAG 


GAAGCCTGTA 


6251 


AGCTGACGCC 


CCCACATTCG 


GCCAGATCTA AATTTGGCTA 


TGGGGCAAAG 


6301 


GACGTCCGGA 


ACCTATCCAG 


CAAGGCCGTT 


AACCACATCC 


GCTCCGTGTG 


6351 


GAAGGACTTG 


CTGGAAGACA 


CTGAGACACC 


AATTGACACC 


ACCATCATGG 


6401 


CAAAAAATGA 


GGTTTTCTGC 


GTCCAACCAG AGAAGGGGGG 


CCGCAAGCCA 


6451 


GCTCGCCTTA 


TCGTATTCCC 


AGATTTGGGG 


GTTCGTGTGT 


GCGAGAAAAT 


6501 


GGCCCTTTAC 


GATGTGGTCT 


CCACCCTCCC 


TCAGGCCGTG 


ATGGGCTCTT 


6551 


CATACGGATT 


CCAATACTCT 


CCTGGACAGC 


GGGTCGAGTT 


CCTGGTGAAT 


6601 


GCCTGGAAAG 


CGAAGAAATG 


CCCTATGGGC 


TTCGCATATG 


ACACCCGCTG 


6651 


TTTTGACTCA 


ACGGTCACTG 


AGAATGACAT 


CCGTGTTGAG 


GAGTCAATCT 


6701 


ACCAATGTTG 


TGACTTGGCC 


CCCGAAGCCA 


GACAGGCCAT 


AAGGTCGCTC 


6751 


ACAGAGCGGC 


TTTACATCGG 


GGGCCCCCTG 


ACTAATTCTA 


AAGGGCAGAA 


6801 


CTGCGGCTAT 


CGCCGGTGCC 


GCGCGAGCGG 


TGTACTGACG 


ACCAGCTGCG 


6851 


GTAATACCCT 


CACATGTTAC 


TTGAAGGCCG 


CTGCGGCCTG 


TCGAGCTGCG 


6901 


AAGCTCCAGG 


ACTGCACGAT 


GCTCGTATGC 


GGAGACGACC 


TTGTCGTTAT 


6951 


CTGTGAAAGC 


GCGGGGACCC 


AAGAGGACGA 


GGCGAGCCTA 


CGGGCCTTCA 


7001 


CGGAGGCTAT 


GACTAGATAC 


TCTGCCCCCC 


CTGGGGACCC 


GCCCAAACCA 


7051 


GAATACGACT 


TGGAGTTGAT 


AACATCATGC 


TCCTCCAATG 


TGTCAGTCGC 


7101 


GCACGATGCA 


TCTGGCAAAA 


GGGTGTACTA 


TCTCACCCGT 


GACCCCACCA 


7151 


CCCCCCTTGC 


GCGGGCTGCG 


TGGGAGACAG 


C TAG AC AC AC 


TCCAGTCAAT 


7201 


TCCTGGCTAG 


GCAACATCAT 


CATGTATGCG 


CCCACCTTGT 


GGGCAAGGAT 


7251 


GATCCTGATG 


ACTCATTTCT 


TCTCCATCCT 


TCTAGCTCAG 


GAACAACTTG 


7301 


AAAAAGCCCT 


AGATTGTCAG 


ATCTACGGGG 


CCTGTTACTC 


CATTGAGCCA 


7351 


CTTGACCTAC 


CTCAGATCAT 


TCAACGACTC 


CATGGCCTTA 


GCGCATTTTC 


7401 


ACTCCATAGT 


TACTCTCCAG 


GTGAGATCAA 


TAGGGTGGCT 


TCATGCCTCA 


7451 


GGAAACTTGG 


GGTACCGCCC 


TTGCGAGTCT 


GGAGACATCG 


GGCCAGAAGT 








CCAGGGGGGG 


AGGGCTGCCA 


LI Ibl VjVjC e\J\ 


7551 


GTACCTCTTC 


AACTGGGCAG 


TAAGGACCAA 


GCTCAAACTC 


ACTCCAATCC 


7601 


CGGCTGCGTC 


CCAGTTGGAT 


TTATCCAGCT 


GGTTCGTTGC 


TGGTTACAGC 


7651 


GGGGGAGACA 


TATATCACAG 


CCTGTCTCGT 


GCCCGACCCC 


GCTGGTTCAT 


7701 


GTGGTGCCTA 


CTCCTACTTT 


CTGTAGGGGT 


AGGCATCTAT 


CTACTCCCCA 
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7751 ACCGATGAAC GGGGAGCTAA ACACTCCAGG CCAATAGGCC ATCCTGTTTT 

7801 TTTCCCTTTT TTTTTTTCTT TTTTTTTTTT TTTTTTTTTT TTTTTTTTTT 

7851 TTCTCCTTTT TTTTTCCTCT TTTTTTCCTT TTCTTTCCTT TGGTGGCTCC 

7901 ATCTTAGCCC TAGTCACGGC TAGCTGTGAA AGGTCCGTGA GCCGCTTGAC 

7951 TGCAGAGAGT GCTGATACTG GCCTCTCTGC AGATCAAGTA CTTCTAGAGA 

8001 ATTCTAGCTT GGCGTAATCA TGGTCATAGC TGTTTCCTGT GTGAAATTGT 

8051 TATCAGCTCA CAATTCCACA CAACATACGA GCCGGAAGCA TAAAGTGTAA 

8101 AGCCTGGGAT GCCTAATGAG TGAGCTAACT CACATTAGTT GCGTTGCGCT 

8151 CACTGCCCGC TTTCCAGTCG GGAAACCTGT CGTGCCAGCT CCATTAGTGA 

8201 ATCGTCCAAC GCACGGGGAG AGGCGGTTTG CGTATTGGGC GCACTTCCGC 

8251 TTCCTCGCTC ACTGACTCGC TGCGCTCGTT CGTTCGGCTG CGGCGAGCCG 

8301 TATCAGCTCA CTCAAAGGCG GTAATACGGT TATCCACAGA ATCAGGGGAT 

8351 AACGCAGGAA AGACCATGTG AGCAAAAGGC CAGCAAAAGG CCAGGAACCG 

8401 TAAAAAGGCC GCGTTGCTGG CGTTTTTCCA TAGGCTCCGC CCCCCTGACG 

8451 AGCATCACAA AAATCGACGC TCAAGTCAGA GGTGGCG AAA CCCGACAGGA 

8501 CTATAAAGAT ACCAGGCGTT TCCCCCTGGA AGCTCCCTCG TGCGCTCTCC 

8551 TGTTCCGACC CTGCCGCTTA CCGGATACCT GTCCGCCTTT CTCCCTTCGG 

8601 GAAGCGTGGC GCTTTCTCAT AGCTCACGCT GTAGGTATCT CAGTTCGGTG 

8651 TAGGTCGTTC GCTCCAAGCT GGGCTGTGTG CACGAACCCC CCGTTCAGCC 

87 01 CGACCGCTGC GCCTTATCCG GTAACTATCG TCTTGAGTCC AACCCGGTAA 

8751 GACACGACTT ATCGCCACTG GCAGCAGCCA CTGGTAACAG GATTAGCAGA 

8801 GCGAGGTATG TAGGCGGTGC TACAGAGTTC TTGAAGTGGT GGCCTAACTA 

8851 CGGCTACACT AGAAGGACAG TATTTGGTAT CTGCGCTCTG CTGAAGCCAG 

8901 TTACCTTCGG AAAAAGAGTT GGTAGCTCTT GATCCGGCAA ACAAACCACC 

8951 GCTGGTAGCG GTGGTTTTTT TGTTTGCAAG CAGCAGATTA CGCGCAGAAA 

9001 AAAAGGATCT CAAGAAGATC CTTTGATCTT TTCTACGGGG TCTGACGCTC 

9051 AGTGGAACGA AAACTCACGT TAAGGGATTT TGGTCATGAG ATTATCAAAA 

9101 AGGATCTTCA CCTAGATCCT TTTAAATTAA AAATGAAGTT TTAAATCAAT 

9151 CTAAAGTATA TATGAGTAAA CTTGGTCTGA CAGTTACCAA TGCTTAATCA 

92 01' GTGAGGCACC TATCTCAGCG ATCTGTCTAT TTCGTTCATC CATAGTTGCC 

92 51 TGACTCCCCG TCGTGTAGAT AACTACGATA CGGGAGGGCT TACCATCTGG 



FIG. IF 



6/7 



WO 02/059321 PCT/EP02/00526 



9301 


CCCCAGTGCT 


GCAATGATAC 


CGCGAGAACC 


ACGCTCACCC 


GCACCAGATT 


9351 


TATCAGCAAT 


AAACCAGCCA 


GCCGGAAGTG 


CGCTGCGGAG 


AAGTGGTCCT 


9401 


GCAACTTTAT 


CCGCCTCCAT 


CCAGTCTATT 


AGTTGTTGCC 


GGGAAGCTAG 


9451 


AGTAAGTAGT 


TCGCCAGTCA 


GCAGTTTGCG 


TAACGTCGTT 


GCCATAGCAA 


9501 


CAGGCATCGT 


GGTGTCACGC 


TCGTCGTTTG 


GTATGGCTTC 


ATTCAGCTCC 


9551 


GGCTCCCAAC 


GATCAAGGCG 


AGTTACATGA 


TCCCCCATGT 


TGTGCAAAAA 


9601 


AGCGGTTAGC 


TCCTTCGGTC 


CTCCGATCGT TGTCAGAAGT 


AAGTTGGCCG 


9651 


CAGTGTTATC 


ACTCATGGTT 


ATGGCAGCAC 


TGCATAATTC 


TCTTACTGTC 


9701 


ATGCCATCCG 


TAAGATGCTT 


TTCTGTGACT 


GGTGAGTACT 


CAACCAAGTC 


9751 


ATTCTGAGAA 


TAGTGTATGC 


GGCGACCGAG 


TTGCTCTTGC 


CCGGCGTCAA 


9801 


TACGGGATAA 


TACCGCGCCA 


CATAGCAGAA 


CTTTAAAAGT 


GCTCATCATT 


9851 


GGAAAACGTT 


CTTCGGGGCG 


AAAACTCTCA AGGATCTTAC 


CGCTGTTGAG 


9901 


ATCCAGTTCG 


ATGTAACCCA 


CTCGTGCACC 


CAACTGATCT 


TCAGCATCTT 


9951 


TTACTTTCAC 


CAGCGTTTCT 


GGGTGAGCAA 


AAACAGGAAG 


GCAAAATGCC 


10001 


GCAAAAAAGG 


GAATAAGGGC 


GACACGGAAA 


TGTTGAATAC 


TCATACTCTT 


10051 


CCTTTTTCAA 


TATTATTGAA 


GCATTTATCA 


GGGTTATTGT 


CTCATGAGCG 


10101 


GATACATATT 


TGAATGTATT 


TAGAAAAATA 


AACAAATAGG 


GGTTCCGCGC 


10151 


ACATTTCCCC 


GAAAAGTGCC 


ACCTGACGTC 


TAAGAAACCA 


TTATTACCAT 


10201 


GACATTAACC 


TATAAAAATA 


GGCGTATCAC 


GAAGCCCTTT 


CGTCTAGCGC 


10251 


GTTTCGGTGA 


TGACGGTGAA 


AACCTCTGAC 


ACTTGCAGCT 


CCCGCAGACG 


10301 


GTCACAGCTT 


GTCTGTAAGC 


GGATGCCGGG 


AGCAGGCAAG 


CCCGTCAGGG 


10351 


CGCGTCAGTG 


GGTGTTGGCG 


GGTGTCGGGG 


CTGGCTTAAC 


TATGCGGCAT 


10401 


CAGAGCAGAT 


TGTACTGAGA 


GTACACCAGA 


TGCGGTGTGA 


AATACCGCAC 


10451 


AGATGCGTAA 


GGAGAAAATA 


CCGCATCAGC 


CTCCATTCGC 


CATTCAGACT 


10501 


CCGCAACTGT 


TGGGAAGGGC 


GGTCAGTACG 


CGCTTCTTCG 


CTATTACGCC 


10551 


AACTGGCGAA 


AGGGGGATGT 


GCTGCAAGGC 


GATTAAGTTG 


GGTAACGCCA 


10601 


GGGTTTTCCC 


AATCACGACG 


TTGTAAAACG 


ACAGCCAATG 


AATTGAAGCT 



10651 TATTAATTCT AGACTGAAGC ttttaatacg actcactata (SEQ. ED. NO.:3) 

Fig. 1G 
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SEQUENCE LISTING 

<110> Istituto Di Ricerche Di Biologia Molecolare P. Angeletti S.P.A. 

<120> HEPATITIS C VIRUS REPLICONS AND REPLICON 
ENHANCED CELLS 

<130> IT0003 PCT 

<150> 60/263,479 
<151> 2001-01-23 

<160> 13 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 3010 
<212> PRT 

<213> Con 1 HCV isolate nucleic acid 
<400> 1 

Met Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn 

1 5 10 15 

Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin lie Val Gly 

20 25 30 

Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 

35 40 45 

Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 

50 55 60 

lie Pro Lys Ala Arg Gin Pro Glu Gly Arg Ala Trp Ala Gin Pro Gly 
65 70 75 80 

Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Leu Gly Trp Ala Gly Trp 

85 90 95 

Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro 

100 105 110 

Arg Arg Arg Ser Arg Asn Leu Gly Lys Val lie Asp Thr Leu Thr Cys 

115 120 125 

Gly Phe Ala Asp Leu Met Gly Tyr lie Pro Leu Val Gly Ala Pro Leu 

130 135 140 

Gly Gly Ala Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 
145 150 155 160 

Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser lie 

165 170 175 

Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr lie Pro Ala Ser Ala Tyr 

180 185 190 

Glu Val Arg Asn Val Ser Gly Val Tyr His Val Thr Asn Asp Cys Ser 

195 200 205 

Asn Ala Ser lie Val Tyr Glu Ala Ala Asp Met He Met His Thr Pro 

210 215 220 

Gly Cys Val Pro Cys Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val 
225 230 235 240 

Ala Leu Thr Pro Thr Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr 

245 250 255 

Thr He Arg Arg His Val Asp Leu Leu Val Gly Ala Ala Ala Leu Cys 

260 265 270 

Ser Ala Met Tyr Val Gly Asp Leu Cys Gly Ser Val Phe Leu Val Ala 

275 280 285 

Gin Leu Phe Thr Phe Ser Pro Arg Arg His Glu Thr Val Gin Asp Cys 

290 295 300 

Asn Cys Ser He Tyr Pro Gly His Val Thr Gly His Arg Met Ala Trp 
305 310 315 320 

- 1 - 
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Asp Met Met Met Asn Trp Ser Pro Thr Ala Ala Leu Val Val Ser Gin 

325 330 335 

Leu Leu Arg He Pro Gin Ala Val Val Asp Met Val Ala Gly Ala His 

340 345 350 

Trp Gly Val Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp 

355 360 365 

Ala Lys Val Leu He Val Met Leu Leu Phe Ala Gly Val Asp Gly Gly 

370 375 380 

Thr Tyr Val Thr Gly Gly Thr Met Ala Lys Asn Thr Leu Gly He Thr 
385 390 395 400 

Ser Leu Phe Ser Pro Gly Ser Ser Gin Lys He Gin Leu Val Asn Thr 

405 410 415 

Asn Gly Ser Trp His He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser 

420 425 430 

Leu Asn Thr Gly Phe Leu Ala Ala Leu Phe Tyr Val His Lys Phe Asn 

435 440 445 

Ser Ser Gly Cys Pro Glu Arg Met Ala Ser Cys Ser Pro He Asp Ala 

450 455 460 

Phe Ala Gin Gly Trp Gly Pro He Thr Tyr Asn Glu Ser His Ser Ser 
465 470 475 480 

Asp Gin Arg Pro Tyr Cys Trp His Tyr Ala Pro Arg Pro Cys Gly He 

485 490 495 

Val Pro Ala Ala Gin Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser 

500 505 510 

Pro Val Val Val Gly Thr Thr Asp Arg Phe Gly Val Pro Thr Tyr Ser 

515 520 525 

Trp Gly Glu Asn Glu Thr Asp Val Leu Leu Leu Asn Asn Thr Arg Pro 

530 535 540 

Pro Gin Gly Asn Trp Phe Gly Cys Thr Trp Met Asn Ser Thr Gly Phe 
545 550 555 560 

Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn He Gly Gly He Gly Asn 

565 570 575 

Lys Thr Leu Thr Cys Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala 

580 585 " 590 

Thr Tyr Thr Lys Cys Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys Leu 

595 600 605 

Val His Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe 

610 615 620 

Thr He Phe Lys Val Arg Met Tyr Val Gly Gly Val Glu His Arg Leu 
625 630 635 640 

Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg Cys Asn Leu Glu Asp 

645 650 655 

Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu Trp 

660 665 670 

Gin Val Leu Pro Cys Ser Phe Thr Thr Leu Pro Ala Leu Ser Thr Gly 

675 680 685 

Leu He His Leu His Gin Asn Val Val Asp Val Gin Tyr Leu Tyr Gly 

690 695 700 

He Gly Ser Ala Val Val Ser Phe Ala He Lys Trp Glu Tyr Val Leu 
705 710 715 720 

Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg Val Cys Ala Cys Leu Trp 

725 730 735 

Met Met Leu Leu He Ala Gin Ala Glu Ala Ala Leu Glu Asn Leu Val 

740 745 750 

Val Leu Asn Ala Ala Ser Val Ala Gly Ala His Gly He Leu Ser Phe 

755 760 765 

Leu Val Phe Phe Cys Ala Ala Trp Tyr He Lys Gly Arg Leu Val Pro 

770 775 780 

Gly Ala Ala Tyr Ala Leu Tyr Gly Val Trp Pro Leu Leu Leu Leu Leu 
785 790 795 800 

Leu Ala Leu Pro Pro Arg Ala Tyr Ala Met Asp Arg Glu Met Ala Ala 
805 810 815 
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Ser 


Cvs 


Gly Gly Ala 


Val 


Phe 


Val 


Gly Leu He 


Leu Leu 


Thr 


Leu 


Ser 






820 








825 




830 






Pro 


His 


Tvr Lvs Leu 


Phe 


Leu 


Ala 


Arg Leu He Trp Trp 


Leu 


Gin 


Tvr 
iy j. 






835 






840 




845 








Phe 


He 


Thr Arg Ala 


Glu 


Ala 


His 


Leu Gin Val 


Trp lie 


Pro 


Pro 


T.oi i 




850 






855 






860 








Asn 


Val 


niy uiy vjxy 


y 


A en 
nop 


Al a 


Val lie Leu 


Leu Thr 


\+ y o 


Al a 


Tip 


865 






870 






875 








Rfi 0 


His 


Pro 


Glu Leu Tie 

UiU UCU 11C 


Phe 


Thr 


lie 


Thr Lys lie 


Leu Leu 


Ala 


He 


ijcu 






OOJ 








890 






HQS 




Glv 


prrv 


T.en Mot 1 


uc u 


V7JLI1 


Al P 


Gly lie Thr Lys Val 


i J. KJ 


Tyr 


Phe 














905 




910 






v a x 


r\-L y 


Ala Ui c Pi v 

Hid ru.£» vjiy 


Leu 


Tip 

J. -Lfc; 


Arg 


Ala Cys Met 


Leu Val 


Arg 


Lys 


Val 






915 






920 




925 










Gly 


Gly His Tyr 


vai 


Gin 


Met 


Ala Leu Met 


Lys Leu 


Ala 


Ala 


Leu 




930 






935 






940 








Thr 


Gly 


Thr Tyr Val 


Tyr 


Asp 


His 


Leu Thr Pro 


Leu Arg 


Asp 


Trp 


Ala 


945 






950 






955 








960 


His 


Ala 


Gly Leu Arg 


Asp 


Leu 


Ala 


Val Ala Val 


Glu Pro 


Val 


Val 


Phe 






965 








970 






975 




Ser 


Asp 


Met Glu Thr 


Lys 


Val 


lie 


Thr Trp Gly Ala Asp 


Thr 


Ala 


Ala 






980 








985 




990 






Cys 


Gly 


Asp He He 


Leu 


Gly 


Leu 


Pro Val Ser 


Ala Arg 


Arg 


Gly Arg 






995 






1000 


1005 






Glu 


He 


His Leu Gly 


Pro 


Ala 


Asp 


Ser Leu Glu 


Gly Gin 


Gly 


Trp 


Arg 




1010 




1015 




1020 








Leu 


Leu 


Ala Pro lie 


Thr 


Ala 


Tyr 


Ser Gin Gin 


Thr Arg 


Gly 


Leu 


Leu 


1025 




1030 




1035 






1040 


Gly 


Cys 


lie lie Thr 


Ser 


Leu 


Thr 


Gly Arg Asp 


Arg Asn 


Gin 


Val 


Glu 






1045 






1050 






1055 


Gly 


Glu 


Val Gin Val 


Val 


Ser 


Thr 


Ala Thr Gin 


Ser Phe 


Leu 


Ala 


Thr 






1060 








1065 




1070 




Cys 


Val 


Asn Gly Val 


Cys 


Trp 


Thr 


Val Tyr His 


Gly Ala 


Gly 


Ser 


Lys 






1075 






1080 


1085 






Thr 


Leu 


Ala Gly Pro 


Lys 


Gly 


Pro 


lie Thr Gin 


Met Tyr 


Thr 


Asn 


Val 




1090 




1095 




1100 








Asp 


Gin 


Asp Leu Val 


Gly 


Trp 


Gin 


Ala Pro Pro 


Gly Ala 


Arg 


Ser 


Leu 


1105 




1110 




1115 






1120 


Thr 


Pro 


Cys Thr Cys 


Gly 


Ser 


Ser 


Asp Leu Tyr 


Leu Val 


Thr 


Arg 


His 






1125 






1130 






1135 


Ala 


Asp 


Val lie Pro 


Val 


Arg 


Arg 


Arg Gly Asp 


Ser Arg 


Gly 


Ser 


Leu 






1140 








1145 




1150 




Leu 


Ser 


Pro Arg Pro 


Val 


Ser 


Tyr 


Leu Lys Gly 


Ser Ser 


Gly 


Gly 


Pro 






1155 






1160 


1165 






Leu 


Leu 


Cys Pro Ser 


Gly 


His 


Ala 


Val Gly lie 


Phe Arg 


Ala 


Ala 


Val 




1170 




1175 




1180 








Cys 


Thr 


Arg Gly Val 


Ala 


Lys 


Ala 


Val Asp Phe 


Val Pro 


Val 


Glu 


Ser 


1185 




1190 




1195 






1200 


Met 


Glu 


Thr Thr Met 


Arg 


Ser 


Pro 


Val Phe Thr 


Asp Asn 


Ser 


Ser 


Pro 






1205 






1210 






1215 


Pro 


Ala 


Val Pro Gin 


Thr 


Phe 


Gin 


Val Ala His 


Leu His 


Ala 


Pro 


Thr 






1220 








1225 




1230 




Gly 


Ser 


Gly Lys Ser 


Thr 


Lys 


Val 


Pro Ala Ala 


Tyr Ala Ala 


Gin 


Gly 






1235 






1240 


1245 






Tyr 


Lys 


Val Leu Val 


Leu 


Asn 


Pro 


Ser Val Ala 


Ala Thr 


Leu 


Gly 


Phe 




1250 




1255 




1260 








Gly Ala 


Tyr Met Ser 


Lys 


Ala 


His Gly lie Asp 


Pro Asn 


lie 


Arg 


Thr 


1265 






1270 




1275 








1280 


Gly Val 


Arg Thr He 


Thr Thr Gly Ala Pro lie 


Thr Tyr 


Ser 


Thr 


Tyr 






1285 






1290 






1295 




Gly 


Lys 


Phe Leu Ala 


Asp Gly 


Gly Cys Ser Gly Gly Ala 


Tyr 


Asp 


lie 






1300 








1305 




1310 
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lie He Cys Asp Glu Cys His Ser Thr Asp Ser Thr Thr He Leu Gly 

1315 1320 1325 

He Gly Thr Val Leu Asp Gin Ala Glu Thr Ala Gly Ala Arg Leu Val 

1330 1335 1340 

Val Leu Ala Thr Ala Thr Pro Pro Gly Ser Val Thr Val Pro His Pro 
1345 1350 1355 1360 

Asn He Glu Glu Val Ala Leu Ser Ser Thr Gly Glu He Pro Phe Tyr 

1365 1370 1375 

Gly Lys Ala lie Pro He Glu Thr He Lys Gly Gly Arg His Leu lie 

1380 1385 1390 

Phe Cys His Ser Lys Lys Lys Cys Asp Glu Leu Ala Ala Lys Leu Ser 

1395 1400 1405 

Gly Leu Gly Leu Asn Ala Val Ala Tyr Tyr Arg Gly Leu Asp Val Ser 

1410 1415 1420 

Val He Pro Thr Ser Gly Asp Val He Val Val Ala Thr Asp Ala Leu 
1425 1430 1435 1440 

Met Thr Gly Phe Thr Gly Asp Phe Asp Ser Val He Asp Cys Asn Thr 

1445 1450 1455 

Cys Val Thr Gin Thr Val Asp Phe Ser Leu Asp Pro Thr Phe Thr lie 

1460 1465 1470 

Glu Thr Thr Thr Val Pro Gin Asp Ala Val Ser Arg Ser Gin Arg Arg 

1475 1480 1485 

Gly Arg Thr Gly Arg Gly Arg Met Gly He Tyr Arg Phe Val Thr Pro 

1490 1495 1500 

Gly Glu Arg Pro Ser Gly Met Phe Asp Ser Ser Val Leu Cys Glu Cys 
1505 1510 1515 1520 

Tyr Asp Ala Gly Cys Ala Trp Tyr Glu Leu Thr Pro Ala Glu Thr Ser 

1525 1530 , 1535 

Val Arg Leu Arg Ala Tyr Leu Asn Thr Pro Gly Leu Pro Val Cys Gin 

1540 1545 1550 

Asp His Leu Glu Phe Trp Glu Ser Val Phe Thr Gly Leu Thr His He 

1555 1560 1565 

Asp Ala His Phe Leu Ser Gin Thr Lys Gin Ala Gly Asp Asn Phe Pro 

1570 1575 1580 

Tyr Leu Val Ala Tyr Gin Ala Thr Val Cys Ala Arg Ala Gin Ala Pro 
1585 1590 1595 1600 

Pro Pro Ser Trp Asp Gin Met Trp Lys Cys Leu lie Arg Leu Lys Pro 

1605 1610 1615 

Thr Leu His Gly Pro Thr Pro Leu Leu Tyr Arg Leu Gly Ala Val Gin 

1620 1625 1630 

Asn Glu Val Thr Thr Thr His Pro lie Thr Lys Tyr He Met Ala Cys 

1635 1640 1645 

Met Ser Ala Asp Leu Glu Val Val Thr Ser Thr Trp Val Leu Val Gly 

1650 1655 1660 

Gly Val Leu Ala Ala Leu Ala Ala Tyr Cys Leu Thr Thr Gly Ser Val 
1665 1670 1675 1680 

Val lie Val Gly Arg lie lie Leu Ser Gly Lys Pro Ala lie lie Pro 

1685 1690 1695 

Asp Arg Glu Val Leu Tyr Arg Glu Phe Asp Glu Met Glu Glu Cys Ala 

1700 1705 1710 

Ser His Leu Pro Tyr lie Glu Gin Gly Met Gin Leu Ala Glu Gin Phe 

1715 1720 1725 

Lys Gin Lys Ala lie Gly Leu Leu Gin Thr Ala Thr Lys Gin Ala Glu 

1730 1735 1740 

Ala Ala Ala Pro Val Val Glu Ser Lys Trp Arg Thr Leu Glu Ala Phe 
1745 1750 1755 1760 

Trp Ala Lys His Met Trp Asn Phe He Ser Gly lie Gin Tyr Leu Ala 

1765 1770 1775 

Gly Leu Ser Thr Leu Pro Gly Asn Pro Ala lie Ala Ser Leu Met Ala 

1780 1785 1790 

Phe Thr Ala Ser lie Thr Ser Pro Leu Thr Thr Gin His Thr Leu Leu 
1795 1800 1805 
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Phe Asn lie Leu Gly Gly Trp Val Ala Ala Gin Leu Ala Pro Pro Ser 

1810 1815 1820 

Ala Ala Ser Ala Phe Val Gly Ala Gly He Ala Gly Ala Ala Val Gly 
1825 1830 1835 1840 

Ser He Gly Leu Gly Lys Val Leu Val Asp He Leu Ala Gly Tyr Gly 

1845 1850 1855 

Ala Gly Val Ala Gly Ala Leu Val Ala Phe Lys Val Met Ser Gly Glu 

1860 1865 1870 

Met Pro Ser Thr Glu Asp Leu Val Asn Leu Leu Pro Ala He Leu Ser 

1875 1880 1885 

Pro Gly Ala Leu Val Val Gly Val Val Cys Ala Ala He Leu Arg Arg 

1890 1895 1900 

His Val Gly Pro Gly Glu Gly Ala Val Gin Trp Met Asn Arg Leu He 
1905 1910 1915 1920 

Ala Phe Ala Ser Arg Gly Asn His Val Ser Pro Thr His Tyr Val Pro 

1925 1930 1935 

Glu Ser Asp Ala Ala Ala Arg Val Thr Gin He Leu Ser Ser Leu Thr 

1940 1945 1950 

He Thr Gin Leu Leu Lys Arg Leu His Gin Trp He Asn Glu Asp Cys 

1955 1960 1965 

Ser Thr Pro Cys Ser Gly Ser Trp Leu Arg Asp Val Trp Asp Trp He 

1970 1975 1980 

Cys Thr Val Leu Thr Asp Phe Lys Thr Trp Leu Gin Ser Lys Leu Leu 
1985 1990 1995 - 2000 

Pro Arg Leu Pro Gly Val Pro Phe Phe Ser Cys Gin Arg Gly Tyr Lys 

2005 2010 2015 

Gly Val Trp Arg Gly Asp Gly He Met Gin Thr Thr Cys Pro Cys Gly 

2020 2025 2030 

Ala Gin He Thr Gly His Val Lys Asn Gly Ser Met Arg He Val Gly 

2035 2040 2045 

Pro Arg Thr Cys Ser Asn Thr Trp His Gly Thr Phe Pro He Asn Ala 

2050 2055 2060 

Tyr Thr Thr Gly Pro Cys Thr Pro Ser Pro Ala Pro Asn Tyr Ser Arg 
2065 2070 2075 2080 

Ala Leu Trp Arg Val Ala Ala Glu Glu Tyr Val Glu Val Thr Arg Val 

2085 2090 2095 

Gly Asp Phe His Tyr Val Thr Gly Met Thr Thr Asp Asn Val Lys Cys 

2100 2105 2110 

Pro Cys Gin Val Pro Ala Pro Glu Phe Phe Thr Glu Val Asp Gly Val 

2115 2120 2125 

Arg Leu His Arg Tyr Ala Pro Ala Cys Lys Pro Leu Leu Arg Glu Glu 

2130 2135 2140 

Val Thr Phe Leu Val Gly Leu Asn Gin Tyr Leu Val Gly Ser Gin Leu 
2145 2150 2155 2160 

Pro Cys Glu Pro Glu Pro Asp Val Ala Val Leu Thr Ser Met Leu Thr 

2165 2170 2175 

Asp Pro Ser His lie Thr Ala Glu Thr Ala Lys Arg Arg Leu Ala Arg 

2180 2185 2190 

Gly Ser Pro Pro Ser Leu Ala Ser Ser Ser Ala Ser Gin Leu Ser Ala 

2195 2200 2205 

Pro Ser Leu Lys Ala Thr Cys Thr Thr Arg His Asp Ser Pro Asp Ala 

2210 2215 2220 

Asp Leu He Glu Ala Asn Leu Leu Trp Arg Gin Glu Met Gly Gly Asn 
2225 2230 2235 2240 

He Thr Arg Val Glu Ser Glu Asn Lys Val Val lie Leu Asp Ser Phe 

2245 2250 2255 

Glu Pro Leu Gin Ala Glu Glu Asp Glu Arg Glu Val Ser Val Pro Ala 

2260 2265 2270 

Glu He Leu Arg Arg Ser Arg Lys Phe Pro Arg Ala Met Pro He Trp 

2275 2280 2285 

Ala Arg Pro Asp Tyr Asn Pro Pro Leu Leu Glu Ser Trp Lys Asp Pro 
2290 2295 2300 
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Asp Tyr Val Pro Pro Val Val His Gly Cys Pro Leu Pro Pro Ala Lys 
2305 2310 2315 2320 

Ala Pro Pro lie Pro Pro Pro Arg Arg Lys Arg Thr Val Val Leu Ser 

2325 2330 2335 

Glu Ser Thr Val Ser Ser Ala Leu Ala Glu Leu Ala Thr Lys Thr Phe 

2340 2345 2350 

Gly Ser Ser Glu Ser Ser Ala Val Asp Ser Gly Thr Ala Thr Ala Ser 

2355 2360 2365 

Pro Asp Gin Pro Ser Asp Asp Gly Asp Ala Gly Ser Asp Val Glu Ser 

2370 2375 2380 

Tyr Ser Ser Met Pro Pro Leu Glu Gly Glu Pro Gly Asp Pro Asp Leu 
2385 2390 2395 2400 

Ser Asp Gly Ser Trp Ser Thr Val Ser Glu Glu Ala Ser Glu Asp Val 

2405 2410 2415 

Val Cys Cys Ser Met Ser Tyr Thr Trp Thr Gly Ala Leu lie Thr Pro 

2420 2425 2430 

Cys Ala Ala Glu Glu Thr Lys Leu Pro lie Asn Ala Leu Ser Asn Ser 

2435 2440 2445 

Leu Leu Arg His His Asn Leu Val Tyr Ala Thr Thr Ser Arg Ser Ala 

2450 2455 2460 

Ser Leu Arg Gin Lys Lys Val Thr Phe Asp Arg Leu Gin Val Leu Asp 
2465 2470 2475 2480 

Asp His Tyr Arg Asp Val Leu Lys Glu Met Lys Ala Lys Ala Ser Thr 

2485 2490 2495 

Val Lys Ala Lys Leu Leu Ser Val Glu Glu Ala Cys Lys Leu Thr Pro 

2500 2505 2510 

Pro His Ser Ala Arg Ser Lys Phe Gly Tyr Gly Ala Lys Asp Val Arg 

2515 2520 2525 

Asn Leu Ser Ser Lys Ala Val Asn His He Arg Ser Val Trp Lys Asp 

2530 2535 2540 

Leu Leu Glu Asp Thr Glu Thr Pro He Asp Thr Thr He Met Ala Lys 
2545 2550 2555 2560 

Asn Glu Val Phe Cys Val Gin Pro Glu Lys Gly Gly Arg Lys Pro Ala 

2565 2570 2575 

Arg Leu He Val Phe Pro Asp Leu Gly Val Arg Val Cys Glu Lys Met 

2580 2585 2590 

Ala Leu Tyr Asp Val Val Ser Thr Leu Pro Gin Ala Val Met Gly Ser 

2595 2600 2605 

Ser Tyr Gly Phe Gin Tyr Ser Pro Gly Gin Arg Val Glu Phe Leu Val 

2610 2615 2620 

Asn Ala Trp Lys Ala Lys Lys Cys Pro Met Gly Phe Ala Tyr Asp Thr 
2625 2630 2635 ~ 2640 

Arg Cys Phe Asp Ser Thr Val Thr Glu Asn Asp lie Arg Val Glu Glu 

2645 2650 2655 

Ser lie Tyr Gin Cys Cys Asp Leu Ala Pro Glu Ala Arg Gin Ala lie 

2660 2665 2670 

Arg Ser Leu Thr Glu Arg Leu Tyr lie Gly Gly Pro Leu Thr Asn Ser 

2675 2680 2685 

Lys Gly Gin Asn Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Leu 

2690 2695 2700 

Thr Thr Ser Cys Gly Asn Thr Leu Thr Cys Tyr Leu Lys Ala Ala Ala 
2705 2710 2715 2720 

Ala Cys Arg Ala Ala Lys Leu Gin Asp Cys Thr Met Leu Val Cys Gly 

2725 2730 2735 

Asp Asp Leu Val Val lie Cys Glu Ser Ala Gly Thr Gin Glu Asp Glu 

2740 2745 2750 

Ala Ser Leu Arg Ala Phe Thr Glu Ala Met Thr Arg Tyr Ser Ala Pro 

2755 2760 2765 

Pro Gly Asp Pro Pro Lys Pro Glu Tyr Asp Leu Glu Leu lie Thr Ser 

2770 2775 2780 

Cys Ser Ser Asn Val Ser Val Ala His Asp Ala Ser Gly Lys Arg Val 
2785 2790 2795 2800 
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Tyr Tyr Leu Thr Arg Asp Pro Thr Thr Pro Leu Ala Arg Ala Ala Trp 

2805 2810 2815 

Glu Thr Ala Arg His Thr Pro Val Asn Ser Trp Leu Gly Asn He He 

2820 2825 2830 

Met Tyr Ala Pro Thr Leu Trp Ala Arg Met He Leu Met Thr His Phe 

2835 2840 2845 

Phe Ser He Leu Leu Ala Gin Glu Gin Leu Glu Lys Ala Leu Asp Cys 

2850 2855 2860 

Gin He Tyr Gly Ala Cys Tyr Ser lie Glu Pro Leu Asp Leu Pro Gin 
2865 2870 2875 2880 

He He Gin Arg Leu His Gly Leu Ser Ala Phe Ser Leu His Ser Tyr 

2885 2890 2895 

Ser Pro Gly Glu He Asn Arg Val Ala Ser Cys Leu Arg Lys Leu Gly 

2900 2905 2910 

Val Pro Pro Leu Arg Val Trp Arg His Arg Ala Arg Ser Val Arg Ala 

2915 2920 2925 

Arg Leu Leu Ser Gin Gly Gly Arg Ala Ala Thr Cys Gly Lys Tyr Leu 

2930 2935 2940 

Phe Asn Trp Ala Val Arg Thr Lys Leu Lys Leu Thr Pro He Pro Ala 
2945 2950 2955 2960 

Ala Ser Gin Leu Asp Leu Ser Ser Trp Phe Val Ala Gly Tyr Ser Gly 

2965 2970 2975 

Gly Asp lie Tyr His Ser Leu Ser Arg Ala Arg Pro Arg Trp Phe Met 

2980 2985 2990 

Trp Cys Leu Leu Leu Leu Ser Val Gly Val Gly lie Tyr Leu Leu Pro 
2995 3000 3005 

Asn Arg 
3010 

<210> 2 
<211> 9605 
<212> DNA 

<213> Con 1 HCV isolate amino acid 
<400> 2 

gccagccccc gattgggggc gacactccac catagatcac tcccctgtga ggaactactg 60 

tcttcacgca gaaagcgtct agccatggcg ttagtatgag tgtcgtgcag cctccaggac 120 

cccccctccc gggagagcca tagtggtctg cggaaccggt gagtacaccg gaattgccag 180 

gacgaccggg tcctttcttg gatcaacccg ctcaatgcct ggagatttgg gcgtgccccc 240 

gcgagactgc tagccgagta gtgttgggtc gcgaaaggcc ttgtggtact gcctgatagg 300 

gtgcttgcga gtgccccggg aggtctcgta gaccgtgcac catgagcacg aatcctaaac 360 

ctcaaagaaa aaccaaacgt aacaccaacc gccgcccaca ggacgtcaag ttcccgggcg 42 0 

gtggtcagat cgtcggtgga gtttacctgt tgccgcgcag gggccccagg ttgggtgtgc 480 

gcgcgactag gaagacttcc gagcggtcgc aacctcgtgg aaggcgacaa cctatcccca 54 0 

aggctcgcca gcccgagggt agggcctggg ctcagcccgg gtacccctgg cccctctatg 600 

gcaatgaggg cttggggtgg gcaggatggc tcctgtcacc ccgtggctct cggcctagtt 660 

ggggccccac ggacccccgg cgtaggtcgc gcaatttggg taaggtcatc gataccctca 720 

cgtgcggctt cgccgatctc atggggtaca ttccgctcgt cggcgccccc ctagggggcg 780 

ctgccagggc cctggcgcat ggcgtccggg ttctggagga cggcgtgaac tatgcaacag 840 

ggaatctgcc cggttgctcc ttttctatct tccttttggc tttgctgtcc tgtttgacca 900 

tcccagcttc cgcttatgaa gtgcgcaacg tatccggagt gtaccatgtc acgaacgact 960 

gctccaacgc aagcattgtg tatgaggcag cggacatgat catgcatacc cccgggtgcg 1020 

tgccctgcgt tcgggagaac aactcctccc gctgctgggt agcgctcact cccacgctcg 1080 

cggccaggaa cgctagcgtc cccactacga cgatacgacg ccatgtcgat ttgctcgttg 1140 

gggcggctgc tctctgctcc gctatgtacg tgggagatct ctgcggatct gttttcctcg 1200 

tcgcccagct gttcaccttc tcgcctcgcc ggcacgagac agtacaggac tgcaattgct 1260 

caatatatcc cggccacgtg acaggtcacc gtatggcttg ggatatgatg atgaactggt 1320 

cacctacagc agccctagtg gtatcgcagt tactccggat cccacaagct gtcgtggata 1380 

tggtggcggg ggcccattgg ggagtcctag cgggccttgc ctactactcc atggtgggga 1440 

actgggctaa ggttctgatt gtgatgctac tctttgccgg cgttgacggg ggaacctatg 1500 

tgacaggggg gacgatggcc aaaaacaccc tcgggattac gtccctcttt tcacccgggt 1560 

catcccagaa aatccagctt gtaaacacca acggcagctg gcacatcaac aggactgccc 1620 
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tgaactgcaa tgactccctc aacactgggt tccttgctgc gctgttctac gtgcacaagt 1680 

tcaactcatc tggatgccca gagcgcatgg ccagctgcag ccccatcgac gcgttcgctc 1740 

aggggtgggg gcccatcact tacaatgagt cacacagctc ggaccagagg ccttattgtt 1800 

ggcactacgc accccggccg tgcggtatcg tacccgcggc gcaggtgtgt ggtccagtgt 1860 

actgcttcac cccaagccct gtcgtggtgg ggacgaccga ccggttcggc gtccctacgt 1920 

acagttgggg ggagaatgag acggacgtgc tgcttcttaa caacacgcgg ccgccgcaag 1980 

gcaactggtt tggctgtaca tggatgaata gcactgggtt caccaagacg tgcgggggcc 2040 

ccccgtgtaa catcgggggg atcggcaata aaaccttgac ctgccccacg gactgcttcc 2100 

ggaagcaccc cgaggccact tacaccaagt gtggttcggg gccttggttg acacccagat 2160 

gcttggtcca ctacccatac aggctttggc actacccctg cactgtcaac tttaccatct 2220 

tcaaggttag gatgtacgtg gggggagtgg agcacaggct cgaagccgca tgcaattgga 2280 

ctcgaggaga gcgttgtaac ctggaggaca gggacagatc agagcttagc ccgctgctgc 2340 

tgtctacaac ggagtggcag gtattgccct gttccttcac caccctaccg gctctgtcca 2400 

ctggtttgat ccatctccat cagaacgtcg tggacgtaca atacctgtac ggtatagggt 2460 

cggcggttgt ctcctttgca atcaaatggg agtatgtcct gttgctcttc cttcttctgg 2520 

cggacgcgcg cgtctgtgcc tgcttgtgga tgatgctgct gatagctcaa gctgaggccg 2580 

ccctagagaa cctggtggtc ctcaacgcgg catccgtggc cggggcgcat ggcattctct 2640 

ccttcctcgt gttcttctgt gctgcctggt acatcaaggg caggctggtc cctggggcgg 27 00 

catatgccct ctacggcgta tggccgctac tcctgctcct gctggcgtta ccaccacgag 2760 

catacgccat ggaccgggag atggcagcat cgtgcggagg cgcggttttc gtaggtctga 2820 

tactcttgac cttgtcaccg cactataagc tgttcctcgc taggctcata tggtggttac 2880 

aatattttat caccagggcc gaggcacact tgcaagtgtg gatccccccc ctcaacgttc 2940 

gggggggccg cgatgccgtc atcctcctca cgtgcgcgat ccacccagag ctaatcttta 3000 

ccatcaccaa aatcttgctc gccatactcg gtccactcat ggtgctccag gctggtataa 3060 

ccaaagtgcc gtacttcgtg cgcgcacacg ggctcattcg tgcatgcatg ctggtgcgga 3120 

aggttgctgg gggtcattat gtccaaatgg ctctcatgaa gttggccgca ctgacaggta 3180 

cgtacgttta tgaccatctc accccactgc gggactgggc ccacgcgggc ctacgagacc 3240 

ttgcggtggc agttgagccc gtcgtcttct ctgatatgga gaccaaggtt atcacctggg 3300 

gggcagacac cgcggcgtgt ggggacatca tcttgggcct gcccgtctcc gcccgcaggg 33 60 

ggagggagat acatctggga ccggcagaca gccttgaagg gcaggggtgg cgactcctcg 3420 

cgcctattac ggcctactcc caacagacgc gaggcctzact tggctgcatc atcactagcc 3480 

tcacaggccg ggacaggaac caggtcgagg gggaggtcca agtggtctcc accgcaacac 3540 

aatctttcct ggcgacctgc gtcaatggcg tgtgttggac tgtctatcat ggtgccggct 3600 

caaagaccct tgccggccca aagggcccaa tcacccaaat gtacaccaat gtggaccagg 3660 

acctcgtcgg ctggcaagcg ccccccgggg cgcgttcctt gacaccatgc acctgcggca 3720 

gctcggacct ttacttggtc acgaggcatg ccgatgtcat tccggtgcgc cggcggggcg 3780 

acagcagggg gagcctactc tcccccaggc ccgtctccta cttgaagggc tcttcgggcg 3840 

gtccactgct ctgcccctcg gggcacgctg tgggcatctt tcgggctgcc gtgtgcaccc 3900 

gaggggttgc gaaggcggtg gactttgtac ccgtcgagtc tatggaaacc actatgcggt 3960 

ccccggtctt cacggacaac tcgtcccctc cggccgtacc gcagacattc caggtggccc 4020 

atctacacgc ccctactggt agcggcaaga gcactaaggt gccggctgcg tatgcagccc 4080 

aagggtataa ggtgcttgtc ctgaacccgt ccgtcgccgc caccctaggt ttcggggcgt 4140 

atatgtctaa ggcacatggt atcgacccta acatcagaac cggggtaagg accatcacca 4200 

cgggtgcccc catcacgtac tccacctatg gcaagtttct tgccgacggt ggttgctctg 4260 

ggggcgccta tgacatcata atatgtgatg agtgccactc aactgactcg accactatcc 4320 

tgggcatcgg cacagtcctg gaccaagcgg agacggctgg agcgcgactc gtcgtgctcg 4380 

ccaccgctac gcctccggga tcggtcaccg tgccacatcc aaacatcgag gaggtggctc 4440 

tgtccagcac tggagaaatc ccctcttatg gcaaagccat ccccatcgag accatcaagg 4500 

gggggaggca cctcattttc tgccattcca agaagaaatg tgatgagctc gccgcgaagc 4560 

tgtccggcct cggactcaat gctgtagcat attaccgggg ccttgatgta tccgtcatac 4620 

caactagcgg agacgtcatt gtcgtagcaa cggacgctct aatgacgggc tttaccggcg 4680 

atttcgactc agtgatcgac tgcaatacat gtgtcaccca gacagtcgac ttcagcctgg 4740 

acccgacctt caccattgag acgacgaccg tgccacaaga cgcggtgtca cgctcgcagc 4800 

ggcgaggcag gactggtagg ggcaggatgg gcatttacag gtttgtgact ccaggagaac 4860 

ggccctcggg catgttcgat tcctcggttc tgtgcgagtg ctatgacgcg ggctgtgctt 4920 

ggtacgagct cacgcccgcc gagacctcag ttaggttgcg ggcttaccta aacacaccag 4980 

ggttgcccgt ctgccaggac catctggagt tctgggagag cgtctttaca ggcctcaccc 5040 

acatagacgc ccatttcttg tcccagacta agcaggcagg agacaacttc ccctacctgg 5100 

tagcatacca ggctacggtg tgcgccaggg ctcaggctcc acccccatcg tgggaccaaa 5160 

tgtggaagtg tctcatacgg ctaaagccta cgctgcacgg gccaacgccc ctgctgtata 5220 

ggctgggagc cgttcaaaac gaggttacta ccacacaccc cataaccaaa tacatcatgg 5280 

catgcatgtc ggctgacctg gaggtcgtca cgagcacctg ggtgccggCa ggcggagtcc 5340 
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tagcagctct ggccgcgtat tgcctgacaa caggcagcgt ggtcattgtg ggcaggatca 5400 
tcttgtccgg aaagccggcc atcattcccg acagggaagt cctttaccgg gagttcgatg 5460 
agatggaaga gtgcgcctca cacctccctt acatcgaaca gggaatgcag ctcgccgaac 5520 
aattcaaaca gaaggcaatc gggttgctgc aaacagccac caagcaagcg gaggctgctg 5580 
ctcccgtggt ggaatccaag tggcggaccc tcgaagcctt ctgggcgaag catatgtgga 5640 
atttcatcag cgggatacaa tatttagcag gcttgtccac tctgcctggc aaccccgcga 5700 
tagcatcact gatggcattc acagcctcta tcaccagccc gctcaccacc caacataccc 5760 

tcctgtttaa catcctgggg ggatgggtgg ccgcccaact tgctcctccc agcgctgctt 5820 

ctgctttcgt aggcgccggc atcgctggag cggctgttgg cagcataggc cttgggaagg 5880 

tgcttgtgga tattttggca ggttatggag caggggtggc aggcgcgctc gtggccttta 5940 

aggtcatgag cggcgagatg ccctccaccg aggacctggt taacctactc cctgctatcc 6000 

tctcccctgg cgccctagtc gtcggggtcg tgtgcgcagc gatactgcgt cggcacgtgg 6060 

gcccagggga gggggctgtg cagtggatga accggctgat agcgttcgct tcgcggggta 6120 

accacgtctc ccccacgcac tatgtgcctg agagcgacgc tgcagcacgt gtcactcaga 6180 

tcctctctag tcttaccatc actcagctgc tgaagaggct tcaccagtgg atcaacgagg 6240 

actgctccac gccatgctcc ggctcgtggc taagagatgt ttgggattgg atatgcacgg 6300 

tgttgactga tttcaagacc tggctccagt ccaagctcct gccgcgattg ccgggagtcc 6360 

ccttcttctc atgtcaacgt gggtacaagg gagtctggcg gggcgacggc atcatgcaaa 6420 

ccacctgccc atgtggagca cagatcaccg gacatgtgaa aaacggttcc atgaggatcg 6480 

tggggcctag gacctgtagt aacacgtggc atggaacatt ccccattaac gcgtacacca 6540 

cgggcccctg cacgccctcc ccggcgccaa attattctag ggcgctgtgg cgggtggctg 6600 

ctgaggagta cgtggaggtt acgcgggtgg gggatttcca ctacgtgacg ggcatgacca 6660 

ctgacaacgt aaagtgcccg tgtcaggttc cggcccccga attcttcaca gaagtggatg 6720 

gggtgcggtt gcacaggtac gctccagcgt gcaaacccct cctacgggag gaggtcacat 6780 

tcctggtcgg gctcaatcaa tacctggttg ggtcacagct cccatgcgag cccgaaccgg 6840 

acgtagcagt gctcacttcc atgctcaccg acccctccca cattacggcg gagacggcta 6900 

agcgtaggct ggccagggga tctcccccct ccttggccag ctcatcagct agccagctgt 6960 

ctgcgccttc cttgaaggca acatgcacta cccgtcatga ctccccggac gctgacctca 7020 

tcgaggccaa cctcctgtgg cggcaggaga tgggcgggaa catcacccgc gtggagtcag 7080 

aaaataaggt agtaattttg gactctttcg agccgctcca agcggaggag gatgagaggg 7140 

aagtatccgt tccggcggag atcctgcgga ggtccaggaa attccctcga gcgatgccca 7200 

tatgggcacg cccggattac aaccctccac tgttagagtc ctggaaggac ccggactacg 7260 

tccctccagt ggtacacggg tgtccattgc cgcctgccaa ggcccctccg ataccacctc 7320 

cacggaggaa gaggacggtt gtcctgtcag aatctaccgt gtcttctgcc ttggcggagc 7380 

tcgccacaaa gaccttcggc agctccgaat cgtcggccgt cgacagcggc acggcaacgg 7440 

cctctcctga ccagccctcc gacgacggcg acgcgggatc cgacgttgag tcgtactcct 7500 

ccatgccccc ccttgagggg gagccggggg atcccgatct cagcgacggg tcttggtcta 7560 

ccgtaagcga ggaggctagt gaggacgtcg tctgctgctc gatgtcctac acatggacag 7620 

gcgccctgat cacgccatgc gctgcggagg aaaccaagct gcccatcaat gcactgagca 7680 

actctttgct ccgtcaccac aacttggtct atgctacaac atctcgcagc gcaagcctgc 7740 

ggcagaagaa ggtcaccttt gacagactgc aggtcctgga cgaccactac cgggacgtgc 7800 

tcaaggagat gaaggcgaag gcgtccacag ttaaggctaa acttctatcc gtggaggaag 7860 

cctgtaagct gacgccccca cattcggcca gatctaaatt tggctatggg gcaaaggacg 7920 

tccggaacct atccagcaag gccgttaacc acatccgctc cgtgtggaag gacttgctgg 7980 

aagacactga gacaccaatt gacaccacca tcatggcaaa aaatgaggtt ttctgcgtcc 8040 

aaccagagaa ggggggccgc aagccagctc gccttatcgt attcccagat ttgggggttc 8100 

gtgtgtgcga gaaaatggcc ctttacgatg tggtctccac cctccctcag gccgtgatgg 8160 

gctcttcata cggaCtccaa tactctcctg gacagcgggt cgagttcctg gtgaatgcct 8220 

ggaaagcgaa gaaatgccct atgggcttcg catatgacac ccgctgtttt gactcaacgg 8280 

tcactgagaa tgacatccgt gttgaggagt caatctacca atgttgtgac ttggcccccg 8340 

aagccagaca ggccataagg tcgctcacag agcggcttta catcgggggc cccctgacta 8400 

attctaaagg gcagaactgc ggctatcgcc ggtgccgcgc gagcggtgta ctgacgacca 8460 

gctgcggtaa taccctcaca tgttacttga aggccgctgc ggcctgtcga gctgcgaagc 8520 

tccaggactg cacgatgctc gtatgcggag acgaccttgt cgttatctgt gaaagcgcgg 8580 

ggacccaaga ggacgaggcg agcctacggg ccttcacgga ggctatgact agatactctg 8640 

ccccccctgg ggacccgccc aaaccagaat acgacttgga gttgataaca tcatgctcct 8700 

ccaatgtgtc agtcgcgcac gatgcatctg gcaaaagggt gtactatctc acccgtgacc 8760 

ccaccacccc ccttgcgcgg gctgcgtggg agacagctag acacactcca gtcaattcct 8820 

ggctaggcaa catcatcatg tatgcgccca ccttgtgggc aaggatgatc ctgatgactc 8880 

atttcttctc catccttcta gctcaggaac aacttgaaaa agccctagat tgtcagatct 8940 

acggggcctg ttactccatt gagccacttg acctacctca gatcattcaa cgactccatg 9000 

gccttagcgc at 1 1 tcactc catagttact ctccaggtga gatcaatagg gtggctztcat 9060 



-9- 



WO 02/059321 



PCT/EP02/00526 



gcctcaggaa acttggggta ccgcccttgc gagtctggag acatcgggcc agaagtgtcc 9120 

gcgctaggct actgtcccag ggggggaggg ctgccacttg tggcaagtac ctcttcaact 9180 

gggcagtaag gaccaagctc aaactcactc caatcccggc tgcgtcccag ttggatttat 9240 

ccagctggtt cgttgctggt tacagcgggg gagacatata tcacagcctg tctcgtgccc 9300 

gaccccgctg gttcatgtgg tgcctactcc tactttctgt aggggtaggc atctatctac 9360 

tccccaaccg atgaacgggg agctaaacac tccaggccaa taggccatcc tgtttttttc 9420 

cctttttttt tttctttttt tttttttttt tttttttttt ttttttttct cctttttttt 9480 

tcctcttttt ttccttttct ttcctttggt ggctccatct tagccctagt cacggctagc 9540 

tgtgaaaggt ccgtgagccg cttgactgca gagagtgctg atactggcct ctctgcagat 9600 

caagt ~ 9605 



<210> 3 
<211> 10690 
<212> DNA 

<213> pHCVNeo.17 coding 
<400> 3 

gccagccccc gattgggggc gacactccac catagatcac tcccctgtga ggaactactg 60 

tcttcacgca gaaagcgtct agccatggcg ttagtatgag tgtcgtgcag cctccaggac 120 

cccccctccc gggagagcca tagtggtctg cggaaccggt gagtacaccg gaattgccag 180 

gacgaccggg tcctttcttg gatcaacccg ctcaatgcct ggagatttgg gcgtgccccc 240 

gcgagactgc tagccgagta gtgttgggtc gcgaaaggcc ttgtggtact gcctgatagg 300 

gtgcttgcga gtgccccggg aggtctcgta gaccgtgcac catgagcacg aatcctaaac 360 

ctcaaagaaa aaccaaaggg cgcgccatga ttgaacaaga tggattgcac gcaggttctc 420 

cggccgcttg ggtggagagg ctattcggct atgactgggc acaacagaca atcggctgct 480 

ctgatgccgc cgtgttccgg ctgtcagcgc aggggcgccc ggttcttttt gtcaagaccg 540 

acctgtccgg tgccctgaat gaactgcagg acgaggcagc gcggctatcg tggctggcca 600 

cgacgggcgt tccttgcgca gctgtgctcg acgttgtcac tgaagcggga agggactggc 660 

tgctattggg cgaagtgccg gggcaggatc tcctgtcatc tcaccttgct cctgccgaga 720 

aagtatccat catggctgat gcaatgcggc ggctgcatac gcttgatccg gctacctgcc 780 

cattcgacca ccaagcgaaa catcgcatcg agcgagcacg tactcggatg gaagccggtc 840 

ttgtcgatca ggatgatctg gacgaagagc atcaggggct cgcgccagcc gaactgttcg 900 

ccaggctcaa ggcgcgcatg cccgacggcg aggatctcgt cgtgacccat ggcgatgcct 960 

gcttgccgaa tatcatggtg gaaaatggcc gcttttctgg attcatcgac tgtggccggc 1020 

tgggtgtggc ggaccgctat caggacatag cgttggctac ccgtgatatt gctgaagagc 1080 

ttggcggcga atgggctgac cgcttcctcg tgctttacgg tatcgccgct cccgattcgc 1140 

agcgcatcgc cttctatcgc cttcttgacg agttcttctg agtttaaaca gaccacaacg 1200 

gtttccctct agcgggatca attccgcccc tctccctccc ccccccctaa cgttactggc 1260 

cgaagccgct tggaataagg ccggtgtgcg tttgtctata tgttattttc caccatattg 1320 

ccgtcttttg gcaatgtgag ggcccggaaa cctggccctg tcttcttgac gagcattcct 1380 

aggggtcttt cccctctcgc caaaggaatg caaggtctgt tgaatgtcgt gaaggaagca 1440 

gttcctctgg aagcttcttg aagacaaaca acgtctgtag cgaccctttg caggcagcgg 1500- 

aaccccccac ctggcgacag gtgcctctgc ggccaaaagc cacgtgtata agatacacct 1560 

gcaaaggcgg cacaacccca gtgccacgtt gtgagttgga tagttgtgga aagagtcaaa 1620 

tggctctcct caagcgtatt caacaagggg ctgaaggatg cccagaaggt accccattgt 1680 

atgggatctg atctggggcc tcggtgcaca tgctttacat gtgtttagtc gaggttaaaa 1740 

aacgtctagg ccccccgaac cacggggacg tggttttcct ttgaaaaaca cgataatacc 1800 

atggcgccta ttacggccta ctcccaacag acgcgaggcc tacttggctg catcatcact 1860 

agcctcacag gccgggacag gaaccaggtc gagggggagg tccaagtggt ctccaccgca 1920 

acacaatctt tcctggcgac ctgcgtcaat ggcgtgtgtt ggactgtcta tcatggtgcc 1980 

ggctcaaaga cccttgccgg cccaaagggc ccaatcaccc aaatgtacac caatgtggac 2040 

caggacctcg tcggctggca agcgcccccc ggggcgcgtt ccttgacacc atgcacctgc 2100 

ggcagctcgg acctttactt ggtcacgagg catgccgatg tcattccggt gcgccggcgg 2160 

ggcgacagca gggggagcct actctccccc aggcccgtct cctacttgaa gggctcttcg 2220 

ggcggtccac tgctctgccc ctcggggcac gctgtgggca tctttcgggc tgccgtgtgc 2280 

acccgagggg ttgcgaaggc ggtggacttt gtacccgtcg agtctatgga aaccactatg 2340 

cggtccccgg tcttcacgga caactcgtcc cctccggccg taccgcagac attccaggtg 2400 

gcccatctac acgcccctac tggtagcggc aagagcacta aggtgccggc tgcgtatgca 2460 

gcccaagggt ataaggtgct tgtcctgaac ccgtccgtcg ccgccaccct aggtttcggg 2520 

gcgtatatgt ctaaggcaca tggtatcgac cctaacatca gaaccggggt aaggaccatc 2580 
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accacgggtg cccccatcac gtactccacc tatggcaagt ttcttgccga cggtggttgc 2640 

tctgggggcg cctatgacat cataatatgt gatgagtgcc actcaactga ctcgaccact 2700 

atcctgggca tcggcacagt cctggaccaa gcggagacgg ctggagcgcg actcgtcgtg 2760 

ctcgccaccg ctacgcctcc gggatcggtc accgtgccac atccaaacat cgaggaggtg 2820 

gctctgtcca gcactggaga aatccccttt tatggcaaag ccatccccat cgagaccatc 2880 

aaggggggga ggcacctcat tttctgccat tccaagaaga aatgtgatga gctcgccgcg 2940 

aagctgtccg gcctcggact caatgctgta gcatattacc ggggccttga tgtatccgtc 3000 

ataccaacta gcggagacgt cattgtcgta gcaacggacg ctctaatgac gggctttacc 3060 

ggcgatttcg actcagtgat cgactgcaat acatgtgtca cccagacagt cgacttcagc 3120 

ctggacccga- ccttcaccat tgagacgacg accgtgccac aagacgcggt gtcacgctcg 3180 

cagcggcgag gcaggactgg taggggcagg atgggcattt acaggtttgt gactccagga 3240 

gaacggccct cgggcatgtt cgattcctcg gttctgtgcg agtgctatga cgcgggctgt 3300 

gcttggtacg agctcacgcc cgccgagacc tcagttaggt tgcgggctta cctaaacaca 33 60 

ccagggttgc ccgtctgcca ggaccatctg gagttctggg agagcgtctt tacaggcctc 3420 

acccacatag acgcccattt cttgtcccag actaagcagg caggagacaa cttcccctac 3480 

ctggtagcat accaggctac ggtgtgcgcc agggctcagg ctccacctcc atcgtgggac 3540 

caaatgtgga agtgtctcat acggctaaag cctacgctgc acgggccaac gcccctgctg 3600 

tataggctgg gagccgttca aaacgaggtt actaccacac accccataac caaatacatc 3660 

atggcatgca tgtcggctga cctggaggtc gtcacgagca cctgggtgct ggtaggcgga 3720 

gtcctagcag ctctggccgc gtattgcctg acaacaggca gcgtggtcat tgtgggcagg 3780 

atcatcttgt ccggaaagcc ggccatcatt cccgacaggg aagtccttta ccgggagttc 384 0 

gatgagatgg aagagtgcgc ctcacacctc ccttacatcg aacagggaat gcagctcgcc 3 900 

gaacaattca aacagaaggc aatcgggttg ctgcaaacag ccaccaagca agcggaggct 3960 

gctgctcccg tggtggaatc caagtggcgg accctcgaag ccttctgggc gaagcatatg 4020 

tggaatttca tcagcgggat acaatattta gcaggcttgt ccactctgcc tggcaacccc 4080 

gcgatagcat cactgatggc attcacagcc tctatcacca gcccgctcac cacccaacat 4140 

accctcctgt ttaacatcct ggggggatgg gtggccgccc aacttgctcc tcccagcgct 4200 

gcttctgctt tcgtaggcgc cggcatcgct ggagcggctg ttggcagcat aggccttggg 4260 

aaggtgcttg tggatatttt ggcaggttat ggagcagggg tggcaggcgc gctcgtggcc 4320 

tttaaggtca tgagcggcga gatgccctcc accgaggacc tggttaacct actccctgct 4380 

atcctctccc ctggcgccct agtcgtcggg gtcgtgtgcg cagcgatact gcgtcggcac 444 0 

gtgggcccag gggagggggc tgtgcagtgg atgaaccggc tgatagcgtt cgcttcgcgg 4500 

ggtaaccacg tctcccccac gcactatgtg cctgagagcg acgctgcagc acgtgtcact 4560 

cagatcctct ctagtcttac catcactcag ctgctgaaga ggcttcacca gtggatcaac 4620 

gaggactgct ccacgccatg ctccggctcg tggctaagag atgtttggga ttggatatgc 4680 

acggtgttga ctgatttcaa gacctggctc cagtccaagc tcctgccgcg attgccggga 4740 

gtccccttct tctcatgtca acgtgggtac aagggagtct ggcggggcga cggcatcatg 4800 

caaaccacct gcccatgtgg agcacagatc accggacatg tgaaaaacgg ttccatgagg 4860 

atcgtggggc ctaggacctg tagtaacacg tggcatggaa cattccccat taacgcgtac 4920 

accacgggcc cctgcacgcc ctccccggcg ccaaattatt ctagggcgct gtggcgggtg 4980 

gctgctgagg agtacgtgga ggttacgcgg gtgggggatt tccactacgt gacgggcatg 5040 

accactgaca acgtaaagtg cccgtgtcag gttccggccc ccgaattctt cacagaagtg 5100 

gatggggtgc ggttgcacag gtacgctcca gcgtgcaaac ccctcctacg ggaggaggtc 5160 

acattcctgg tcgggctcaa tcaatacctg gttgggtcac agctcccatg cgagcccgaa 5220 

ccggacgtag cagtgctcac ttccatgctc accgacccct cccacattac ggcggagacg 5280 

gctaagcgta ggctggccag gggatctccc ccctccttgg ccagctcatc agctagccag 5340 

ctgtctgcgc cttccttgaa ggcaacatgc actacccgtc atgactcccc ggacgctgac 5400 

ctcatcgagg ccaacctcct gtggcggcag gagatgggcg ggaacatcac ccgcgtggag 54 60 

tcagaaaata aggtagtaat tttggactct ttcgagccgc tccaagcgga ggaggatgag 5520 

agggaagtat ccgttccggc ggagatcctg cggaggtcca ggaaattccc tcgagcgatg 5580 

cccatatggg cacgcccgga ttacaaccct ccactgttag agtcctggaa ggacccggac 5640 

tacgtccctc cagtggtaca cgggtgtcca ttgccgcctg ccaaggcccc tccgatacca 5700 

cctccacgga ggaagaggac ggttgtcctg tcagaatzcta ccgtgtcttc tgccttggcg 5760 

gagctcgcca caaagacctt cggcagctcc gaatcgtcgg ccgtcgacag cggcacggca 5820 

acggcctctc ctgaccagcc ctccgacgac ggcgacgcgg gatccgacgt tgagtcgtac 5880 

tcctccatgc ccccccttga gggggagccg ggggatcccg atctcagcga cgggtcttgg 5940 

tctaccgtaa gcgaggaggc tagtgaggac gtcgtctgct gctcgatgtc ctacacatgg 6000 

acaggcgccc tgatcacgcc atgcgctgcg gaggaaacca agctgcccat caatgcactg 6060 

agcaactctt tgctccgtca ccacaacttg gtctatgcta caacatctcg cagcgcaagc 6120 

ctgcggcaga agaaggtcac ctttgacaga ctgcaggtcc tggacgacca ctaccgggac 6180 

gtgctcaagg agacgaaggc gaaggcgtcc acagttaagg ctaaacttct atccgtggag 6240 

gaagcctgta agctgacgcc cccacattcg gccagatcta aatttggcta tggggcaaag 6300 
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gacgtccgga acctatccag caaggccgtt aaccacatcc gctccgtgtg gaaggacttg 63 60 

ctggaagaca ctgagacacc aattgacacc accatcatgg caaaaaatga ggttttctgc 6420 

gtccaaccag agaagggggg ccgcaagcca gctcgcctta tcgtattccc agatttgggg 6480 

gttcgtgtgt gcgagaaaat ggccctttac gatgtggtct ccaccctccc tcaggccgtg 6540 

atgggctctt catacggatt ccaatactct cctggacagc gggtcgagtt cctggtgaat 6600 

gcctggaaag cgaagaaatg ccctatgggc ttcgcatatg acacccgctg ttttgactca 6660 

acggtcactg agaatgacat ccgtgttgag gagtcaatct accaatgttg tgacttggcc 6720 

cccgaagcca gacaggccat aaggtcgctc acagagcggc tttacatcgg gggccccctg 6780 

actaattcta aagggcagaa ctgcggctat cgccggtgcc gcgcgagcgg tgtactgacg 6840 

accagctgcg gtaataccct cacatgttac ttgaaggccg ctgcggcctg tcgagctgcg 6900 

aagctccagg actgcacgat gctcgtatgc ggagacgacc ttgtcgttat ctgtgaaagc 6960 

gcggggaccc aagaggacga ggcgagccta cgggccttca cggaggctat gactagatac 7020 

tctgcccccc ctggggaccc gcccaaacca gaatacgact tggagttgat aacatcatgc 7080 

tcctccaatg tgtcagtcgc gcacgatgca tctggcaaaa gggtgtacta tctcacccgt 7140 

gaccccacca ccccccttgc gcgggctgcg tgggagacag ctagacacac tccagtcaat 7200 

tcctggctag gcaacatcat catgtatgcg cccaccttgt gggcaaggat gatcctgatg 7260 

actcatttct tctccatcct tctagctcag gaacaacttg aaaaagccct agattgtcag 7320 

atctacgggg cctgttactc cattgagcca cttgacctac ctcagatcat tcaacgactc 7380 

catggcctta gcgcattttc actccatagt tactctccag gtgagatcaa tagggtggct 7440 

tcatgcctca ggaaacttgg ggtaccgccc ttgcgagtct ggagacatcg ggccagaagt 7500 

gtccgcgcta ggctactgtc ccaggggggg agggctgcca cttgtggcaa gtacctcttc 7560 

aactgggcag taaggaccaa gctcaaactc actccaatcc cggctgcgtc ccagttggat 7620 

ttatccagct ggttcgttgc tggttacagc gggggagaca tatatcacag cctgtctcgt 7680 

gcccgacccc gctggttcat gtggtgccta ctcctacttt ctgtaggggt aggcatctat 7740 

ctactcccca accgatgaac ggggagctaa acactccagg ccaataggcc atcctgtttt 7800 

tttccctttt tttttttctt tttttttttt tttttttttt tttttttttt ttctcctttt 7860 

tttttcctct ttttttcctt ttctttcctt tggtggctcc atcttagccc tagtcacggc 7920 

tagctgtgaa aggtccgtga gccgcttgac tgcagagagt gctgatactg gcctctctgc 7980 

agatcaagta cttctagaga attctagctt ggcgtaatca tggtcatagc tgtttcctgt 8040 

gtgaaattgt tatcagctca caattccaca caacatacga gccggaagca taaagtgtaa 8100 

agcctgggat gcctaatgag tgagctaact cacattagtt gcgttgcgct cactgcccgc 8160 

tttccagtcg ggaaacctgt cgtgccagct ccattagtga atcgtccaac gcacggggag 8220 

aggcggtttg cgtattgggc gcacttccgc ttcctcgctc actgactcgc tgcgctcgtt 8280 

cgttcggctg cggcgagccg tatcagctca ctcaaaggcg gtaatacggt tatccacaga 8340 

atcaggggat aacgcaggaa agaccatgtg agcaaaaggc cagcaaaagg ccaggaaccg 84 00 

taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg agcatcacaa 8460 

aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat accaggcgtt 8520 

tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta ccggatacct 8580 

gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct gtaggtatct 8640 

cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc ccgttcagcc 8700 

cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa gacacgactt 8760 

atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg taggcggtgc 8820 

tacagagttc ttgaagtggt ggcctaacta cggctacact agaaggacag tatttggtat 8880 

ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa 8940 

acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa 9000 

aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc agtggaacga 9060 

aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca cctagatcct 9120 

tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa cttggtctga 9180 

cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat ttcgttcatc 9240 

catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct taccatctgg 9300 

ccccagtgct gcaatgatac cgcgagaacc acgctcaccc gcaccagatt tatcagcaat 9360 

aaaccagcca gccggaagtg cgctgcggag aagtggtcct gcaactttat ccgcctccat 9420 

ccagtctatt agttgttgcc gggaagctag agtaagtagt tcgccagtca gcagtttgcg 9480 

taacgtcgtt gccatagcaa caggcatcgt ggtgtcacgc tcgtcgtttg gtatggcttc 9540 

attcagctcc ggctcccaac gatcaaggcg agttacatga tcccccatgt tgtgcaaaaa 9600 

agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg cagtgttatc 9660 

actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg taagatgctt 9720 

ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc ggcgaccgag 9780 

ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa ctttaaaagt 9840 

gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac cgctgttgag 9900 

atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt ttactttcac 9960 

cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg gaataagggc 10020 
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gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa gcatttatca 10080 

gggttattgt ctcatgagcg gatacatatt tgaatgtatt tagaaaaata aacaaatagg 10140 

ggttccgcgc acatttcccc gaaaagtgcc acctgacgtc taagaaacca ttattaccat 10200 

gacattaacc tataaaaata ggcgtatcac gaagcccttt cgtctagcgc gtttcggtga 10260 

tgacggtgaa aacctctgac acttgcagct cccgcagacg gtcacagctt gtctgtaagc 10320 

ggatgccggg agcaggcaag cccgtcaggg cgcgtcagtg ggtgttggcg ggtgtcgggg 10380 

ctggcttaac tatgcggcat cagagcagat tgtactgaga gtacaccaga tgcggtgtga 10440 

aataccgcac agatgcgtaa ggagaaaata ccgcatcagc ctccattcgc cattcagact 10500 

ccgcaactgt tgggaagggc ggtcagtacg cgcttcttcg ctattacgcc aactggcgaa 10560 

agggggatgt gctgcaaggc gattaagttg ggtaacgcca gggttttccc aatcacgacg 10620 

ttgtaaaacg acagccaatg aattgaagct tattaattct agactgaagc ttttaatacg 10680 

actcactata 10690 



<210> 4 
<211> 23 
<212> DNA 

<213> Primer oligonucleotide 
<400> 4 

acatgatctg cagagaggcc agt 23 

<210> 5 
<211> 26 
<212> DNA 

<213> Primer oligonucleotide 
<400> 5 

gacasgctgt gatawatgtc tccccc 26 

<210> 6 
<211> 21 
<212> DNA 

<213> Primer oligonucleotide 
<400> 6 

tggctctcct caagcgtatt c 21 

<210> 7 
<211> 23 
<212> DNA 

<213> Primer oligonucleotide 
<400> 7 

actctctgca gtcaagcggc tea 23 

<210> 8 
<211> 21 
<212> DNA 

<213> Primer oligonucleotide 
<400> 8 

cagtggatga aceggctgat a 21 

<210> 9 
<211> 23 
<212> DNA 

<213> Primer oligonucleotide 
<400> 9 

ggggegaegg catcatgeaa acc 23 
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<210> 10 
<211> 23 
<212> DNA 

<213> Primer oligonucleotide 
<400> 10 

caggacctgc agtctgtcaa agg 23 

<210> 11 
<211> 17 
<212> DNA 

<213> Primer oligonucleotide 
<400> 11 

cgggagagcc atagtgg 17 

<210> 12 
<211> 19 
<212> DNA 

<213> Primer oligonucleotide 
<400> 12 

agtaccacaa ggcctttcg 19 

<210> 13 
<211> 21 
<212> DNA 
<213> Probe 



<400> 13 

ctgcggaacc ggtgagtaca c 
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